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IMPROVED EPITOPE OISPIAYINS PHASE 



This invention relates to the expression and display 
of libraries of mutated epitopic peptides or potential, 
biasing protein domains on the surface of phage, and the 
screening of those libraries to identify high affinity 



The amino acid sequence of a protein determines its 
three-dimensional C3D> structure, which in turn 
determines protein function. Some residues on the 
polypeptide chain are more issportant than others in 
is determining the 3D structure of a protein. Substitutions 
of amino acids that are exposed to solvent are less 
lifeely to affect the 313 structure than are substitutions 
at internal loci. 

« Protein engineering* is the art of manipulating the 
of a protein in order to alter its binding 
ciracteristies. The factors affecting protein binding 
are known, hut designing new compleraentary surfaces has 
proven difficult. 

With the development of recombinant Dia techniques, 
2S became possible to obtain a mutant protein toy mutating 

the gene encoding the native protein and then expressing 
the mutated gene. Several mutagenesis strategies are 
known > One, "protein surgery % involves the introduction 
of one or more fi£S^3^QBl£^ stations within the gene of 
30 choice. & atals polypeptide of completely predetermined 
sequence is expressed, and its binding characteristics 
ere evaluated. 

At the other extreme is random mat agenesis toy. means 
of relatively nonspecif ie mutagens such as radiation and 
35 varioi 



St is possible to randomly vary predetermined 
nucleotides using a mixture of teases in the appropriate 
cycles of a nucleic acid synthesis procedure. The 
proportion of bases in the mixture, for each position of 
S a eodon, will determine the frequency at which each amino 
acid will occur in the polypeptides expressed from the 
degenerate BNA popxaation, Oliphant et. aX_ (CLIPS 6} and. 
Oliphant and SfcruhX (0LIP8?) jfeave- demonstrated ligation 
and cloning of highly degenerate oligonucleotides, which 
10 were need in the imitation of promoters . They suggested 
that similar methods could toe used in the variation of 
protein coding regions . They do not say how one should ; 
a) choose protein residues to vary, or b) select or 
screen mutants with desirable properties. Raidhaar-Oisou 
IS and Sailer {£BXBSaa5 have used synthetic degenerate oligo- 
nits to vary simultaneously two or three residues through 
all twenty arsino acids. See also i?erehon ^fe i^. 
{VERSSSa; vms&Sb} „ Reidhaar- Olson and Sauar do not 
discuss the limits on how many residues could be varied 
20 at once nor do they mention the problem of unequal 
abundance of Dm encoding different amino acids. 

A number of researchers have directed Mnmut&tf.d 
foreign antigenic epitopes to the surface of phage, fused 
to a native phage surface protein, and demonstrated that 
2S the epitopes were recognised by antibodies, 

Dnlfasceo (DTJLS8S) suggests a procedure for incor- 
porating a foreign antigenic epitope into a viral surface 
protein eo that the expressed chimeric protein is dis- 
played on the surface of the virus in a manner such that 
30 the foreign epitope is accessible to antibody. In 19 85 
Smith (SMIT8S) reported inserting a m s ac 
of the SaoS.1 endonuclease gene into gene III of bacterio- 
phage fx, B in phase* ♦ The gene 111 protein is a minor 
coat protein necessary for infeetivity. Smith demons - 
35 trated that the reeoxaoinant phage were adsorbed by 
isssoMiiKBd antioody raised against the lec&l endonucle- 
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ase, and could toe ©luted with acid. Be Xa crass gj; sl^ 
{PSL&88} have expressed a fragment of the repeat region 
of the eireumeporoaoite protein from 
. falciparum on the stirface of MX3 as as insert in the gene 
5 ill protein. They showed that the recomh inane phage ware 
both antigenic and immunogenic in rabbits, and that such 
recombinant phage could fee used for B epitope mapping. 
The researchers suggest that similar recombinant phage 
could be used for T epitope mapping and for vaccine 

10 development. 

MoCaf f erty a«U <WCC»SQ} expressed a fusion of an 
Fv fragment of an antibody to the »- terminal of the pi XX 
protein. Tim Fv fragment was not mutated. 

Ladasr, Click, and Bird, WO88/0S630 CpabX, 7 Sept. 

IS 1988 and having priority from PS application 07/021 , 046, 
assigned to Geaax Corp,} (LQB) speculate that diverse 
single chain antibody domains (SC&D) may he screened for 
binding to a particular antigen toy varying the » 
encoding the combining determining regions of a single 

20 chain antibody, sutocXoning the SC&D gene into the gpV 
gene of phage lambda so that a SCRD/gpV chimera is 
displayed on the outer surface of phage lambda, and 
selecting phage which bind to the antigen through 
af f ini ty chromatography , 

25 Parmley and Smith {PAKK88} suggested that an epitope 

library that exhibits all possible he^apeptidea could foe 
constructed and used to isolate epitopes that bind to 
antibodies. In discussing the epitope library, the 
authors did not suggest that it was desirable to balance 

30 the representation of different amino acids. Box did 
they teach that the insert should encode a complete 
domain of the exogenous protein. Epitopes are considered 
to toe unstructured peptides as opposed to structured 
proteins. Scott and Smith (SCOTS 0} and Cwlria ex al^ 

3S CCWXES05 prepared » epitope libraries* in which potential 
hexapaptide epitopes for a target antibody were randomly 
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mafcatad fey timing degenerate oligosacleofcides , encoding 
tfas epitopes, witfe gene III of fd phage, and expressing 
the fused gene in phage- infected cells. The cells 
manufactured fusion phage which displayed the epitopes on 
5 .-their surface? the phage which bound to immobilized 
antibody ware elated with acid arid studied. Devlin 
^ Wmmm similarly screened., using MX3 phage, for 
random IS residue epitopes recognised by strsptavidin. 
The Scott and Smith, C*?irla *Ll> •ad Devlin 

10 &L« libraries provided a highly biased sampling of the 
possible amino acids at each position. Their primary 
concern la designing the degenerate oligonucleotide 
encoding their variable region was to ensure that all 
twenty amino acids were encoditole at each position i a 

15 secondary consideration was minimising the frequency of 
Occurrence, of atop signals, consequently, Scott and 
Smith and Cwiria al^ employed HHK < Unequal mixture of 
G, A, T> Cj &~equal mixture of 0 and T) while Devlin ej£. 
used m& IS^equ&l mixture of 6 and C} . There was no 

20 attempt to minimize the frequency ratio of most favored- 
to- least favored amino acid, or to equalize the rate of 
occurrence of acidic and basic amino acids, 

Devlin characterized several affinity- 

selected streptavidin- binding peptides, tout did not 

25 measure the affinity constants for these peptides, 
cwirla s& did determine the affinity constant for his 
peptides, tout were disappointed to find that his best 
bexapepfcid.es had affinities £3S0-300nH} , "orders of 
magnitude* weaker than that of the native Met- enkephalin 

30 epitope nm) recognized toy the target antibody, Cwirla 
at. al,.. speculated that phage bearing peptides with higher 
affinities remained boxing under acidic elation, possibly 
because of multivalent interactions between phage (carry- 
ing about 4 copies of pXXX) and the divalent target XgG. 

35 Scott and Smith were able to find peptides whose affinity 
for the target antibody (A2) was comparable to that of 
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the reference myoheinarythria epitope <S0*sM) . Howsvar, 
Scott and Smith likewise pressed concern that some 
high-affinity peptides were lost, possibly through 
irreversible bi»<ti»g of fusion phage to target, 

S ha&aer, et al, m$8fQ2BQ9 ! incorporated by reference 

herein, describe a process for the generation and 
idantif ieation of novel binding proteins having affinity 
for a predetermined target. In this process, a gene 
encoding a potential binding domain <m distinct from a 

1 : S mere epitopic peptide) , said gene being obtained by 
random mutagenesis of a limited number of predetermined 
cottons, is fused to a genetic element which causes the 
resulting chimeric expression product to be displayed on 
the enter surface of a virus (especially a filamentous 

15 phage) or a cell . Chroaiatographic selection is than used 
to identify viruses or cells whose genome includes such 
a fused gens which coded for the protein which bound to 
the chromatographic target. Ladaer, at &X» discusss 
several methods of recovering the gene of interest when 

20 the viruses or cells is so tightly bound to the target 
that it cannot be washed off in viable form. These are 

growing them situ on the chromatographic matrix, 

fragmenting the matrix and using it as an inoculant into 
a culture vessel, degrading the linkage between the 

25 matrix and the target material , and degrading the viruses 
or cells but then recovering their BNA. However, these 
methods will also recover viruses or calls which are 
nonspecifically bound to the target material, 

WO30/02809 also addressed strategies for 

30 mutagenesis, including one which provides ail twenty 
amino acids in substantially ee^sal proportions f but only 
in the context of mutagenesis of protein domains, not 
epitopic peptides . 



35 



The present invention is intended to overcome the 
deficiencies discussed above* X» one etafoodimenc of the 
ia«atioa, & library of "display phage* is used to 
identify binding domains with a high affinity for a 
5 predetermined target. Potential binding domains are 
displayed on the surface of the phage. This is achieved 
by expressing a fused gene which encodes a chimeric outer 
surface protein comprising the potential binding domain 
and at least a functional portion of a coat protein 

10 native to the phage. The preferred embodiment uses a 
pattern of semirandom mutagenesis, called « variegation®, 
that focuses mutations into those residues of a parental 
binding domain that are most likely to affect its binding 
properties and are least likely to destroy Its underlying 

15 structure. As a result, while any one phage displays 
only a single foreign binding domain (though possibly in 
multiple copies) , the phage library collectively displays 
thousands, even millions, of different binding domains. 
The phage library, is screened by affinity separation 

20 techniques to identify those phage bearing successful, 
(high affinity) binding domains, and these phage are 
recovered and characterised to determine the sequence of 
the successful binding domains. These successful binding 
domains my then serve as the parental binding domains 

as for another round of variegation and affinity separation, 
Xn another embodiment of the invention, the display 
phage display on their surface a chimeric outer surface 
protein comprising a functional portion of a native outer 
surface protein and a potential epitope. In an epitope 

30 library made of. these display phage, the region 
„cort - ag to the foreign epitope is hypervariabie . 

The library is screened with an antibody or other binding 
protein of interest and high affinity epitopes are 
identified. References to display, mutagenesis and 

35 screening of potential binding domains should be taken to 
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apply , mi&&tlg.....mu»^&% to display, mutagenesis and 
screening of potential epitopes, unless stated otherwise. 
As previously mentioned, when severs! copies of the 
chimeric coat protein are displayed on a single phage, 
3 there is a risk that irreversible binding will occur, 
especially if the target is multivalent fas with an 
antibodyK Xn this case, the phage last elntad by an 
elutioa gradient will not fee the ones bearing the highest 
affinity epitopes or binding domains, but rather will foe 

10 those having an affinity high enough to hold on to the 
target under the initial elution conditions hut hot so 
high as to bind irreversibly. As a result, the 
methodology known in the art amy tail to recover very 
high affinity epitopes or- binding domains f which for many 

IS purposes are the most desirable species.. 

We propose to cope with the problem of irreversible 
binding by incorporating into the chimeric coafc protein 
a linker sequence, between the foreign epitope or- binding 
domain, and the sequence native to the wild- type phage 

20 coat protein, which is cleavafole by a site-specific 
protease. In this case, the phage library is incubated 
with the immobilized target. Lower affinity phage are 
eluted off the target and only the solid phase (bearing 
the high affinity phage) is retained. The aforementioned 

25 linker sequence is cleaved, and the phage particles are 
released, leaving the bound epitope or binding domain 
behind. One way then recover the particles {and sequence 
their DH& to determine the sequence of the corresponding 
epitope or binding domain! or the bound peptide (and 

30 sequence its amino acids directly) - The former recovery 
.method is preferred, as the encoding may be amplified 
in vitro using PGR or in vivo by transfecting stii table 
host cells with the high affinity display phage, hliiie 
the production of fusion proteins with tieavsbie linkers 

:3S is known in the art, the use of such linkers to 
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facilitate controlled cleavage of a c&iateric coat protein 
of a phage has not previously been reported, 

toother method of addressing irreversible binding is 
appropriate when the binding domains are "mini-proteins*, 
5 i.e., relatively snail peptides whose stability of 
structure primarily attributable to the presence of one 
or more covalent crosslinks, e.g., disulfide bonds. As 
in the example above, low affinity phage are removed 
first. The remaining, high affinity, bound phage are 

10 then treated with a reagent which breaks the crosslink, 
such as dithiothreitol in the ease of a dosmin with 
disulfide bonds, but does not cleave peptidyl bonds or 
modify the side chains of amino acids which are not 
crosslinks^*- This will usually result in sufficient 

15 denaturation to either release the phage outright or to 
permit their elution by other means. 

These two methods, of course, are not mutually 
exclusive < 

In the previously known epitope display phage 

20 libraries, the phage genome was altered fey r^gl&sisg the 
gene encoding the wild-type gene 111 protein of M13 with 
one encoding a chimeric coat protein. &s a result, the 
five normal copies of the wild type gens 111 protein were 
ail replaced by the chimeric coat protein, whereby each 

25 phage had five potential binding sites for the target, 
and hence a very high potential avidity. With high 
affinity epitopes (or binding domains) , this might well 
contribute to irreversible binding. 

One method of the present invention of reducing the 

30 avidity of display phage, especially epitope display 
, phage , for their target, and hence of alleviating the 
problem of irreversible binding, is to engineer the phage 
to contain £wo genes that each express a coat protein, 
one encoding the wild: type coat protein, and the other 

3S the cognate chimeric coat protein. Thus, phage bearing 
identical epitopes or binding domains amy yet bear 



WO 92/13679 



different ratios of wild type to sMsgric coat protein 
molecules, sua hence have different avidities . *he 
average ratio for the library will be dependent on the 
relative levels of expression of the two cognate genes , 
S it may fee advantageous to be able to modulate the 

ratio or the chimeric coat protein to its cognate wild- 
type coat, protein. For example, early in the 
evolutionary process, the affinity of the blading Atomism 
for their target may be rather low, especially if they 

10 are based on a parental binding domain which has no 
affinity for the target. 

Modulation may fee achieved fey placing the chimeric 
gen® under the control of a r^auli^hl^ promoter, 

Khile it may fee possible to place the cognate wild- 

IS type gene under the control of a second, differently 
regulated promoter, this my he impracticable if, as with 
the M13 genaXIX, the gene is part of a polycistronic 
operonl In this case, expression of the wild- type gene 
may be reduced by replacing its methionine initiation 

20 codon with a leucine initiation codoh. 

MX3 gene III, as. previously noted., encodes one of 
the minor coat proteins of this filamentous phage (five 
copies per phage) . In view of the difficulties with 
irreversible binding reported fey those modifying this 

25 gene so that a foreign epitope is displayed on the phage 
coat, nee of the UX3 gmjL&r c oat protein was clearly 
discouraged. However, we have found that chimeric major 
coat proteins are in fact useful for displaying potential 
fcindtng domains for screening purposes even though 

30 I indeed f somet imes because) there are over a thousand 
copies of this protein per phage. It is believed that 
the major (vIII) coat protein would likewise fee useful in 
constructing an epitope phage library. 

Me have also developed a linker suitable for 

35 attaching potential binding domains {or epitoplc 
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peptides) to this m$ow coat protein, and parh&ps to 
other proteins as wall „ 

Finally, to the eadbsaat that some of the problems 
experienced with epitope libraries have been attributable 
5 to the us® of patterns of mutagenesis which lead to 
highly biased allocations of amino acids, the present 
invention is also directed to a variety of improved 
patterns that lead to less biased and hence more 
efficient epitope phage libraries. 

10 

Figure 1 shows how a phage may be used as a genetic 
phage. At (a) we have a wild- type precoat protein lodged 
in the lipid biiayer. The signal peptide is in the 

IS .periplastic, space * At (b) , a chimeric precoat protein, 
with a potential binding domain interposed between the 
signal peptide and the mature coat protein sequence, is 
similarly trapped. At Co) and {&} } the signal peptide 
has been cleaved off the wild- type and chimeric proteins, 

20 respectively, but certain residues of the coat protein 
sequence interact with the lipid bilayer to prevent, the 
mature protein from passing entirely into the periplasm, 
At is) and (f } , mature wild- type and chimeric protein, are 
assembled into the coat of a single stranded phage as 

25 it emerges into the periplastic space. The phage will 
pass through the outer membrane into the medium where it 
can be recovered and chromatographi cal ly evaluated. 
Figure 2 shows the C£ of the coat protein of phage fx. 

30 DETAILED PE$CTO%Ii^^ 

x. tiwmm stsubhwk 

present invention, contemplates that a potential 
binding domain (pbd) or a potential epitope will be dis- 
ss played on the surface of a phage in the form of a fusion 
with a coat {outer surface) protein COSP) of the phage. 
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This chimeric outer surface protein is the processed 
product of the polypeptide expressed by as display gene 
inserted into the phage genome? therefore s 1} toe genome 
of the phage must allow introduction of the display gene 
5 either by tolerating additional genetic material or by 
having replaceable genetic material $ 2} the virion must 
be capable of packaging the genome after accepting the 
insertion or substitution of genetic material, and 3) the 
display of the OSP-ZPBC protein on the phage surface must 

10 not disrupt virion, structure sufficiently to interfere 
with phage propagation. 

mien the viral particle is assembled, its coat 
proteins may attach themselves to the phages a) from the 
cytoplasm, h) from the periplasm, or c) from within the 

IS lipid Mlayer, The immediate expression product of the 
display gene must feature,- at its amino terminal, a 
functional secretion signal peptide, such as the shaft 
f ignal {mQBTiMMtmmwrw^Tsm , if the coat protein 
attaches to the phage from the periplasm or from within 

.20 the lipid bilayer. If a secretion signal is necessary 
for the display of the potential binding domain, in an 
especially preferred embodiment the bacterial cell in 
which the hybrid gene is expressed is of a 8 secret ion- 
pBrmiseive* strain, 

25 The mt& sequence encoding the foreign epitope or 

binding domain should precede the sequence encoding the 
coat protein proper if the amino terminal of the 
processed coat protein is normally its free end, and 
should follow it if the carboxy terminal is the normal 

30 free end. 

The morphogenetic pathway of the phage determines 
the environment in which the IPBD will have opportunity 
to fold. Periplasm! cally assembled phage are preferred 
when XPSDs contain essential disulfides, as such IPBDs 

3S may not fold within a ceil (these proteins may fold after 
the phage is released from the cell) , Intracellular!:/ 
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assembled phage ara preferred mhan the IFBC needs large 
or insoluble prosthetic groups Csuch as Fe 4 S 4 clusters} f 
since the XPBD aiay not fold if -secreted because the 
prosthetic group is iaeking. 
5 When, variegation is introduced, multiple infections 

could generate hybrid GPs that carry the gene for one SBD 
but have at least sane copies of a different PBD on their 
surfaces? it is preferable to minimise this possibility 
by infecting cells with phage under conditions resulting 

10 in. a low multiple -of -infection {MOD . 

For a given bacteriophage, the preferred OSF. ie 
usually one that Is present on the phage surface in the 
largest number of copies, as this allows the greatest 
flexibility in varying the ratio of OSF-IPSP to wild type 

15 OSF and also gives the highest likelihood of obtaining 
satisfactory affinity separation. Moreover, a protein 
present In, only one or a few copies usually performs an 
essential function in morphogenesis or infection? 
mutating such a protein by addition or insertion is 

as .likely to result in reduction in viability of the <3$. 
Nevertheless, an OSF suet as M13 gill protein may be an 
excellent choice as OSP to cause display of the PBD, 

It is preferred that the wild- type oap. gene be pre- 
served. The jphd geua fragment may be inserted either 

25 into a second copy of the recipient sm& gene or into a 
novel engineered gene. It is preferred that the p§p-„ 
AbM gene be placed under control of a regulated 
promoter. Our process forces the evolution of the PEDs 
derived f rom XPBD so that some of them develop a novel 

30 function., -viz. binding to a chosen target. Placing the 
gene that is subject to evolution on a duplicate gene is 
an imitation of the widely •• accepted scenario for the 
evolution of protein families. It is now generally 
accepted that gene duplication is the first step in the 

35 evolution of a protein family from an ancestral protein. 
By having two copies of a gene, the affected 
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physiological process can tolerate mutations in one of 
the genfes. Tills process is -well understood aad 
documented for the globin family (.ClL. DXCK83 , pSSSf, and 
CRBXS4, plX7-125) . 
S The user must choose a sit® in the candidate O&P 

gene for inserting a 4pM gene fragment. The coats of 
most bacteriophage are highly ordered Filamentous phage 
can he described by a helical lattice; isometric phage, 
by an icosahedral lattice. Each monomer of each major 

10 coat protein aits on a lattice point and xaakes defined 
interact ions with each of its neighbors . Proteins that 
fit into the lattice by making some, but not all, of the 
noxml lattice contacts are likely to destabilise the 
virion toys a) aborting formation of the virion, to} making 

IS the virion unstable, or e) leaving gaps in the virion so 
that the nucleic acid is not protected. Thus in 
bacteriophage, it is important to retain in engineered 
OSP-JPSB fusion proteins those residues of the parental 
OSP that interact with other proteins in the virion. For 

20 M13 gVIXX, we prefer to retain the entire mature protein, 
while for M13 gill,- it might suffice to retain the lest 
100 residues (or even fewer) , Such a truncated gxxx 
protein would be expressed in parallel with the collets 
gXI! protein, as gXXT protein is reguired for phage 

25 inf activity. 

The display gene is placed downstream of a known 
promoter, preferably a regulated promoter such as JUsSBES, 
tat >- or 

l.w>£llia^n£ms.,,.i ! Mge^ 

30 The filamentous phages, which include Ml 3, fl s fd, 

Ifl, Ike, Xt, Ml, and Pf3, are of particular interest. 
The entire life cycle of the filamentous phage Ml 3 , a 
common cloning and sequencing vector., is well understood. 
The genetic structure ISCH&78) of MX3 is well known as is 

35 the physical structure of the virion B0BK80, 
CERTO, I&P&Tg, mHBBSh, XUBH87, K&XQ8Q, M&EV78 , 
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MESS78, 0HKA81, RASC86, RUS8S1, SCK&78 , SMXTSS, W8BS7S, 
and 23MM82) ? see MSCIS for a recent review of the 
structure ami function of the coat proteins. 

Marvin and collaborators {MRHV7&, MMC08G, BSHHSi) 
jfjavs determined an approximate 3D virion structure of Use 
closely related phage fl by a eocibination of genetics, 
biochemistry-' and %-x&y diffraction from fibers of the 
virus. Figure 2 is draws after the model of Banner .s£ 
al. mwmsi) and shows only the C w s of the protein. The 
apparent holes in the cylindrical sheath are actually 
filled by protein side groups so that the Dm within is 
protected. The amino terminus of each protein monomer is 
to the outside of the cylinder, while the carbosiy 
terminus is at smaller radius, near the J>HA. Although 
other filamentous phages f& t ,g, t Ff X or lice) have different 
helical symmetry, all have coats composed of may short 
«-heiital monomers with the amino terminus of each 
monomer on the virion surface. 

3.. M13 miox Coat Protein (gVIII) 

The major coat protein of M13 is encoded by gene 
¥111, The SO amino acid mature gene YT.II coat protein is 
synthesized as a 73 ami.no acid precoat CSCH&78; XT0K7S) . 
The first 23 amino acids constitute a typical signal- 
sequence which causes the nascent polypeptide to he 
inserted into the inner cell membrane. Whether the 
precoat inserts into the membrane by itself or through 
the action of host secretion components., such as Sec& and 
SecY, remains controversial, but has no effect on the 
operation of the present invention , 

Jui &gli signal peptidase tSV-1) recognises amino 
acids 18, 21, end 23, and, to a lesser extent, residue 
22 f and cuts between residues 23 and 24 of the precoat 
(KOHNSSa, KMfSSb, O&Xmi}* after removal of the signal 
sequence, the amino terMnus of the mature coat is 
located on the periplastic side of the inner membrane i 
the carfooxy terminus is on the cytoplasmic side. About 
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3D0G copies of the mature SO amino acid coat protein 
associate side-by- side in the inner siembrane. 

We have constructed a tripartite gene comprising; 
1} DBA. encoding a signal sequence directing secretion 
5 of parts 12) and (3) through the inner membrane, 

2} DBA encoding the mature SPtI sequence, and 
3} BM& encoding the mature M13 gVIII protein. 
This gene causes BPTZ to appear in active form on the 
surface of Ml 3 phage, 
10 2. H±3 Minor Coat Proteins, Generally 

An introduced bxndi ,ng domain or epitope way also foe 
displayed on a f iiamentoue phage as a portion of a 
chimeric minor coat protein. These are encoded by genes 
121 f VI, ¥11, and IX, and. each is present in about S 
IB copies per virion and .is related to morphogenesis or 
infection. In contrast, the major coat protein is 
present in more than 2500 copies per virion. The gene 
XIX, VX, VXX, and XX proteins are present at the ends of 
the virion, 

20 3, The M13 gill Minor Coat Protein 

The single -stranded circular phage DHA associates with 
about five copies of the gene XXI protein and is then 
extruded through the patch of membrane -associated coat 
protein in such a way that the DM is encased in a 

25 helical sheath of protein (WBBS78J . The DH& does not 
base pair (that -would impose severe restrict ions on the 
virus genome) ; rather the bases intercalate with each 
other independent of sequence* 

Smith CSMXT3S) and de la Cms jgfc &L, (BBXA88) have 

30 shown that insertions into gene XIX cause novel protein 
domains to appear on the virion outer surface. The mini- 
protein's gene may toe fused to gene ZI1 at the site used 
by Smifcn and by do la Cxuz aX^, at a codon correspond- 
ing to another domain boundary or to a surface loop of 

35 the protein, or to the amino terminus of the mature 
protein. 
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jkll published works use & sector containing a single 
modified gene XII of Sd* Thus, mil five copies of gill 
are identically modified. Sea® XXI is quite large {1272 
b.p. or about 20% of the phage genome) and it is 
5 uncertain whether a duplicate of the whole gene can he 
stably inserted into the phage, Furthermore, all £iw. 
copies of gill protein are at one end of the virion. 
When bivalent target molecules (such as antibodies) bind 
a pentavalent phage, the resulting complex my he 
10 irreversible, irreversible binding of the phage to the 
target greatly interferes with affinity enrichment of the 
SFs that carry the genetic sequences encoding the novel 
polypeptide having the highest affinity for the target. 
To reduce the likelihood of Sensation of 
15 irreversible complexes, we saay use a second, synthetic 
gene that encodes only the oarboxy- terminal portion of 
. ptX> We might, for example, engineer a gene that 
comprises (from 5' to 3 5 }j 

1} a promoter (preferably regulated) , 
20 2} a ribosome -binding site, 
3} an initiation eodon,. 

4} a sequence encoding a functional signal peptide 
directing secretion of parts <s) and (6) through the 
inner membrane, 
25 5} D8& encoding a potential binding domain, 

€} 3WA encoding residues 27S through 424 of M13 gill 
protein, 

7} a translation stop eodon, arid 
S) (optionally) a transcription stop signal. 
30 Note that in the gXXX protein, the amino terminal 

moiety is responsible for pilus binding (i.e., for 
infectdvity) and the carboxy terminal moiety for 
packaging, so that the chimeric gXXX protein described 
above is able to assemble into the viral coat, but does 
35 not contribute, to inf activity. 



WO 92/13679 



KT/IJS92/0133W 



17 

We leave the wild- type gene XII so that soma 
unaltered gene III protein will fee present - 

Thus, the hybrid gene amy casgaris.® im ^coding a. 
potential binding domain operably linked to a signal 
S sequence (e.g. . the signal sequences of the bacterial 
phoA or bla genes or the signal sequence of HIS phage 
S&S&XXX) and to OT& encoding at least a functional 
portion of a coat protein Ce,g. , the M13 gene III or gene 
¥11:1 proteins} of a filamentous phage MIS) - The 

10 expression product is transported to the inner mmfermm 
(lipid bil&yar) of the host cell, whereupon the signal 
peptide is cleaved off to leave a processed hybrid 
protein. The C- terminus of the coat protein- like com* 
ponent of this hybrid protein is trapped in the lipid 

IS bilayar, so that the hybrid protein does not escape into 
the periplastic space. (This is typical of the wild- type 
coat protein,} As the single -stranded tmh of the nascent 
phage particle passe® tnpo the periplaamic space, it 
collects both wild- type coat protein and the hybrid 

20 protein from the lipid bilayer, The hybrid protein is 
thus phaged into the surface sheath of the filamentous 
phage, leaving the potential binding dentin exposed on 
its outer surface. 

4, Coat Proteins of Ff3 

25 Similar constructions could be made with other 

filamentous phage. PM$ is a well known filamentous phage 
that infects pseudomonas cells that harbor an 

inel'-i plasmld. The entire genome has been sequenced 
(LUXTSS) and the genetic signals involved in replication 

30 and assembly are known (LUIT87) . The major coat protein 
of PF3 is unusual in having no signal peptide to direct 
its secretion, The sequence has charged residues ASP- ? , 
t#S«, and PHSE«-C0Cr which is consistent with the 
amino terminus being exposed. Thus, to cause an to 

33 appear on the surface of Pf3, we construct a tripartite 
gene comprising;; 
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1) 

aeruaeTiosa Cpref erably k 
XFBD) fused in- frame to, 

25 

5 in- frame to, 

3} BM encoding the mature Pf3 coat protein, 
Optionally, BE encoding a flexible linker of one to 10 
amino acids is introduced between the gsna fragment 

and tlie Pf3 m^mfeein optionally, m& encoding 

10 the recognition site for a specific protease, such ass 
tissue plasminogen activator or blood clotting Factor M&, 
is introduced between the IpM gene fragment and the Pf 3 
coat -protein gene, .&min© acids that forts the recognition 
site for a specific protease may also serve the function 
IS of a flexible linker, This tripartite gene is introduced 
into P£3 so that it does not interfere with expression of 
any 3?f3 genes. To reduce the possibility of genetic 
recombination, part (3) is designed to have numerous 
silent mutations relative to the wild -type gene. Once 
is cleaved off, the XPBD is in the 



ly signal. It does not matter that this 
feeion protein comes to rest in the lipid bilayer by a 
route different from the route followed by the wild- type 
coat protein. 

The structural coding sequence of the display gene 
encodes a chimeric coat protein and any required 
secretion signal, & ^chimeric coat protein" is a fusion 
of a first amino acid sequence (essentially corresponding 
to at least a functional portion of a phage coat protein} 
with a second aarino acid eeguenca, e.g., a domain foreign 
to and not substantially homologous with any domain of 
the first protein. A chimeric protein may present a 
foreign domain which is found (albeit in a different 
protein) in an organism which also expresses the first 
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prateia, or it may be an * interspecies" , *iaterge;aeric«% 
etc. fusion of protein structures- expressed by different 
kind© of organisms, The foreign domain may appear at the 
amino or carboxy terminal of the first amino acid 
5 sequence {with or without an intervening spacer ) , or it 
may interrupt the first amino acil sequence. The first 
amino acid sequence may correspond exactly to a surface 
protein of the phage, or it may be modified, to 
facilitate the display of the binding domain. 

10 A preferred, site for insertion of the ip hd gene into 

the phage osp gene is one in which s a) the XPBD folds 
into its original shape, b} the OSP domains fold into 
their original shapes, and c| there is no interference 
between the two domains. 

IS If there is a model of the phage that indicates that 

either the amino or earboscy terminus of an OSP is exposed 
to solvent , then the esspossd terminus of that mature OSP 
becomes the prime candidate for insertion of the IjObfl 
gene. A low resolution m model suffices, 

20 in the absence of a 3D structure, the wsd.no and 

c&rbdxy termini of the mature OSP are the best candidates 
for insertion of the iphd gene. A functional fusion may 
require additional residues between the XPBB and OSP 
domains to avoid unwanted interactions between the 

25 domains. Random- sequence B-Kh or BMA coding for a 
specific sequence of a protein homologous to the IPBD or 
OSP, can be inserted between the pgp fragment and the 
IpM fragment if needed. 

Fusion at a domain boundary within the OSP is also 

30 a, good approach for obtaining a functional fusion. Smith 
exploited such a boundary when suboloning heterologous 
Dffi into gene SI of 11 CSMXTS5K 

The criteria for identifying OSP domains suitable 
for causing display of an XPBB are somewhat different 

35 from those used to identify and XPBD, When identifying 
an OSP, minimal size is not so important because the OSP 
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domain will net appear in the f inal blading jaolecoXe nor 
will, we need to syntfcfcsisse the gene repeatedly ia each 
variegation round. The major Assign concerns are that; 
a) the OSF;sXPBD fusion causes display of XP8D, h) the 
initial genetic construction be reasonably convenient, 
be genetically stable and 
methods of 
rely on atomic 
by Janin and Chechia 
of distances 
(Cj t dividing planes BOSB85) , or 

,) , Chothia and collaborators have 
correlated Che behavior of mmy natural proteins with 
domain structure {according to their definition) . Eashin 
15 correctly .predicted the stability of a domain comprising 
residues 206-316 of thermolysis , 

Many researchers have used partial proteolysis and 
protein sequence analysis to isolate and identify stable 
domains, (See, for example, VXT&84, POTB83, SCOTS 7%, and 
20 53&B079.) Pabo used c&lorimstry as an indicator 

that the cl repressor from the eoliphage } \ contains two 
domains? they then used partial proteolysis to determine 
the location of the domain boundary. 

If the only structural information available is the 
25 amino acid sequence of the candidate OSP, we can use the 
sequence to predict turns and loops. There is a high 
probability that some of the loops and turns will be 
correctly predicted <cf». Cbou and Fssmaa, (CHCU74) ) ? 
these locations are also candidates for insertion of the 
30 ipM gene fragment. 

Fusing one or more new domains to a protein may make 
the ability of the new protein to he ex i rem the 

cell different from the ability of the parental protein. 
The signal peptide of the wild- type coat protein may 
35 function for authentic polypeptide hut he unable to 
direct escort of a fusion, To utilise the Sec- dependent 



WO 92/15679 



PCXAJS92/01539 



pathway, oca stay need a different signal peptide. Tiros, 
to express end display a chimeric BP7X/M13 gene YIIX 
protein, we found it mcmmry to utilise a heterologous 
signal peptide {that of phoh) . 
S Phage that display peptides having high affinity for 

the target »y he quite difficult to eiute from the 
target., particularly a multivalent target. One can 
introduce a cleavage site tor a specific protease, such 
as -blood- clotting Factor Xa, into the fusion protein so 

10 that the binding domain tan he cleaved from the genetic 
package. Such cleavage has the advantage that all 
resulting phage have identical coat proteins and 
therefore are equally infective, even if polypeptide- 
displaying phage can be elated from the affinity matrix 

IS without cleavage. This step allows recovery of valuable 
genes which might otherwise be lost. To our knowledge* 
no one has disclosed or suggested using a specific 
protease as a means to recover an information - containing 
genetic package or of converting a population of phage 
that vary in iaf activity into phage having identical 
inf activity. 

There exist a auabar of highly specific proteases, 
While the invention does not reside in the choice of any 
particular protease, the protease is preferably 

25 sufficiently specific so that under the cleavage 
conditions, it will cleave the linker but not any 
polypeptide essential to the viability of the phage, or 
.{save in rare cases) the potential epitope /binding 
domain. It is possible that choice of particular 

30 cleavage conditions, e.g. , low temperature, may make it 
feasible to use a protease that would otherwise be 
unsuitable, 

The blood- clotting and complementation systems 
contains a number of very specific proteases. Usually, 
3S the ensymes at the early stages of a cascade are more 
specific than are the later ones, for example, Factor % 
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{F*XJ is more specific than is thrombin (op* Table 10-2 
of - Bovine F.X* cleaves after the sequence lie- 

Glu-GXy-arf while ismm W»K cXe»yes after Xla-Asp-Giy- 
Arg. Either protease -linker pair may foe used, as 
S desired. Xf thrombin is used, the most preferred 
thrombin-sensitive Xiakers are those found in fibrinogen, 
Factor XXIX, and prothrombin. Preferably, one would take 
the linker sequence from she species from which the 
tfarearibia is obtained? for example, if bovine thrombin is 

10 to be used, then one uses a linker taken from bovine 
fibrinogen or bovine Fx XXIX. 

Human Factor XX* cleaves human Factor IX at two 
places CCOLMS7 , p. 42) : 

Q T $ I & T EmLJt S A V F and 

3.5 S F ]L$L£JLMmLS V Q S B. 

Thus one could incorporate either of these sequences 
(especially the underscored portions I as a linker between 
the PBD and the QP~ surface-anchor dossain (GPSM>) and use 
human F«XX S to cleave them> 

20 Hma&n kaXXikraln cuts human F. XXI at (COIAJS7, 

p.2S8)i 

This sequence hast significant similarity to the hF.XX* 
sites above > One could incorporate the sequence SSMTKWS 
25 as a linker between PBD and OPS&D and cleave PBD from the 
OP with human k&XXifcxein* 

Human F.XIX, cuts human F.XX at (00X4587, p. 256): 

One could incorporate KXKPRXVG as a linker between PBD 
30 and GPSKD , FED could then be cleaved from SP with 
hF,XXX s . 

Other proteases that .have been used to cleave fusion 
proteins include enterokinase,. collagenase, chymosin, 
urokinase,- renin, and certain signal peptidases. See 
35 Stutter, VB 4,765,325, 
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Whan a proteases inhibitor is sought, the target 
protease and other proteases having similar substrate 
specificity are net preferred for cleaving the PSD from 
the OP. It is preferred that a linker resembling the 
5 substrate of the target protease toe incorporated 

anywhere on the display phage because this could make 
separation of excellent binders from the rest of the 
population needlessly mors difficult. 

If there is steric hindrance of the site-specif ic 

10 cleavage of the linker, the linker may be modified so 
that the cleavage site is more exposed, e.g. f by 
interposing glycines (for additional flexibility) or 
prolines (for waxiaium elongation) between the cleavage 
site and the bulk of the protein, improved 

15 thrombin cleavage of a GST fueion protein by introducing 
a glycine- rich linker (FGXS&GGGG) immediately after the 
thrombin cleavage site (hWRGS) . A suitable linker may 
also toe identified toy variegation-end- selection. 

The sequences of regulatory parts of the gene are 

20 taken from the sequences of natural regulatory elements: 
a) promoters, to) Shine- Daigarno sequences, and c) trans- 
criptional terminators. Regulatory elements could also 
toe designed from knowledge of consensus sequences of 
natural regulatory regions* The sequences; of these 

25 regulatory elements are connected to the coding regions; 
restriction aites are also inserted in or adjacent to the 
regulatory regions to allow convenient manipulation. 

The essential function of the affinity separation is 
to separate GPs that bear PBDs (derived from XPB2» having 

30 high affinity for the target from GPs bearing mm having 
low affinity for the target. If the eiution volume of a 
phage depends on the number of PBDs on the phage surface, 
then a phage bearing xnany FBDs with low affinity, 
GPCPBD*.}, might c©~elute with a phage hearing fewer FBDs 

33 with high affinity, aP{PR%K Regulation of the display 
gene preferably is such that most phage display 
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sufficient PBP to effect a good separation according to 
affinity. Use of a regulatable promoter to control the 
level of egression of the display gene allows fine 
adjustment, of the cimaa&tographic behavior of the 
s %<-ariegated population. 

induction of s>^thesie of engineered genes in 
vegetative bacterial cells has been exercised through the 
use of regulated promoters such as lieISi, ferpP, £&S 
CMSNX82} . The factors that regulate the quantity of 

10 protein synthesized are sufficiently *ell understood that 
a wide variety of heterologous proteins can now ha 
produced in 1^ gpJJU &*. a&yjLi^ and other host cells in 
at least moderate quantities (SKB&88 f BBTT38) - 
Preferably, the promoter for the display gene is subject 

IS to regulation by a email chemical inducer. For example, 
the Ms promoter and the hybrid j^p-lms (£§cj promoter 
are regulatable with isopropyi thiog&Xaetoside IX2?TG) . 
The promoter for the constructed gene need not coma from 
a natural gg& gene? any regulatable promoter functional 

20 in bacteria can be used. A non-leafcy promoter is 
preferred. 

The coding portions of genes to be synthesized are 
designed at the protein level and then encoded in 
While the primary consideration in devising the Dim 

2S sequence is obtaining the desired diverse population of 
potential binding domains ^or epitopes} , consideration is 
also given to providing restriction sites to facilitate 
further gene manipulation, minimising the potential for 
recombination and spontaneous mutation, and achieving 

30 efficient translation in the chosen host cells, 

The present invention is not limited to any parti- 
cular method of »ft synthesis or construction. Conven- 
tional m& synthesizers may be used,- with appropriate 
reagent modifications for production of variegated Dm 

35 (similar to that now used for production of mixed 
probes) „ 
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The phage are genetically engineered and then 

transfected into host cells, e.g.* IL™£QlJL„JL 8:uMtiXlt^ 

or P. aeruginosa, suitable for amplification. Th® 
present invention is not limited to any one method of 
5 transforming cells with BH& or to any particular host 
eelis. 

THE HrXTIXL P01?E!STXM> B1MDH?G P0M&XSI s 

By virtue of the present invention, proteins my be 
obtained which can hind specifically to targets other 

10 than the antigen- combining sites of antibodies. For the 
purposes of the appended claims, a protein P is a bjLMi&g 
protein if for some target other than the variable domain 
of an antibody, the dissociation constant §E S (P f M < JW* 
moles/liter (preferably, * 1Q' 7 moles/liter); The 

XS exclusion of "variable dcsaaia of an antibody* is intended 
t© snake clear that for the purposes herein a protein is 
not to be considered a "binding protein" merely because 
it is antigenic- However, an antigen my nonetheless 
qualify as a binding protein because it specifically 

20 binds to a snbstan.ce other than an antibody, e^g^., an 
smsyme for its substrate, or a hormone for its cellular 
receptor, Additionally, it should be pointed out that 
"binding protein 8 may include a protein which binds 
specifically to the Fo of an antibody, g lt ,g,,„, 

25 staphylococcal protein Ju 

While the present invention amy he used to develop 
novel antibodies through variegation of codone 
corresponding to the hfpervariable region of an 
antibody's variable domain, its primary utility resides 

30 in the developsent of binding proteins which are not 
antibodies or even variable domains of antibodies. Hovel 
antibodies can he obtained by immunological techniques; 
novel enzymes, hormones, sOL, cannot, 

Most larger proteins fold into distinguishable 

35 globules called domains. m& display strategy is first 
perfected by modifying a genetic pbage to display a 
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stable, structured domain Ctne ^Md£ML,sa&^tktMl.. himim 
for which an affinity molecule (which my 
be an. antibody! is Obtainable. The success of the 
jaodificatioss is readily sseasurea by, e, dstersdnios 
5 w&efefcer the modified genetic phage binds to the affinity 
sjolecule. For the purpose of identifying XPBBs, 
definitions of "domain" which emphasis© stability -* 
retention of the overall structure in the face of 
perturbing forces such as elevated temperatures or 

10 chao tropic agents — are favored, though atomic coor- 
dinates and protein sequence homology are not completely 
ignored. Wxm a domain of a protein is primarily 
responsible for the protein's ability to specifically 
mm a chosen target, it is referred to herein as a 

15 "binding domain 8 (BP) , 

The IPSO is chosen with a view to its tolerance for 
extensive mutagenesis. Once it. is known that the XPHD 
can be displayed on a surface of a phage and subjected to 
affinity selection, the gene encoding the XPSD is sub- 

20 j acted to a special pattern of multiple mutagenesis, here 
termed "variegation* , which after appropriate cloning and 
amplification steps leads to the production of a popula- 
tion of phage each of vnich displays a single potential 
binding domain Ca mutant of the I£BD) , but which 

as collectively display a multitude of different though 
structurally related potential binding doaiains {HBDs) * 
Bach genetic phage carries the version of the pM gene 
that encodes the PSD displayed on the surface of that 
particular phage. Affinity selection is then used to 

30 identify the display phage bearing the PBDs with the 
desired binding characteristics., and these display phage 
jaay then be amplified. After one or more cycles of 
enrichment by affinity selection and amplification, the 
mm. encoding the successful binding dossalus (SBbs) may 

35 then be recovered from selected phage. 
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If need toe, the Ml from the SBD-bearing phage asay 
then is® further •variagated":, using an SSD of the last 
round of variegation as the "parental potential binding 
domain* (PPBDJ to the nesct generation of PBDb, and the 
5 process continued until the worker in the art is 
satisfied with the result. At that point, the SSD may be 
produced by any conventional means, including chemical 
synthesis . 

The initial potential binding domain may has 1) a 

10 domain of a naturally occurring protein, 2} a non "-natur- 
ally occurring domain which substantially corresponds; in 
sequence to a naturally occurring domain, but which 
differs from it in sequence by one or more substitutions, 
insertions or deletions, 3} a domain substantially 

IS corresponding in sequence to a hybrid of subsequences of 
two or snore naturally occurring proteins? or 4) an 
artificial domain designed entirely on theoretical 
grounds based on knowledge of amino acid geometries and 
statistical evidence of secondary structure preferences 

20 of amino acids. (However, the limitations of & w&qxX 
protein design prompted the present invention.) Usually, 
the domain will be a known binding dosaain, or at least a 
homologue thereof, but it a»y toe derived from a protein 
which, while not possessing a known binding activity, 

25 possesses a secondary or higher structure that lends 
itself to binding activity {clefts* grooves, . The 

protein to which the XPBD is related need not have any 
specific affinity for the target material. 

In determining whether sequences should fee deemed to 

30 « substantially correspond* , one should consider the 
following issues: the degree of sequence similarity when 
the sequences are aligned for best fit according to 
standard algorithms, the similarity in the connectivity 
patterns of any crosslinks {e.g. , disulfide bonds) » the 

35 degree to which the proteins have similar three- dimen- 
sional structures, as indicated by, e.g. , X-ray diffrac- 
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tion analysis or BMR, and the degree to which the se- 
quenced proteins toe similar biological activity, la 
this context, it should be noted that among the serine 
protease inhibitors, there are families of proteins 
5 recognised to be homologous in which there are pairs of 
mesabers with as little as 30% sequence homology. 

& candidate XPBD should meet the following criteria ; 
X) a domain exists that will remain stable under the 
conditions of its intended use (the domain my 
10 comprise the entire protein that will be inserted, 

e,«. BPTI, of~conotoxln ®X t or <MTX«XXX> , 
2} knowledge of the amino acid sequence is obtain- 
able, and 

3) a molecule is obtainable having specific and high 
IS: affinity for the XPBP, MM£1PBD| . 

Preferably, in order to guide the variegation strategy, 
knowledge of the identity of the residues on the domain's 
outer surface, and their spatial relationships, is 
obtainable; however, this consideration is less important 
20 if the binding domain is small, &„,%,,,., under 4.0 
preferably, the XPBb is no larger than 
because small SBDs (for example, less than 40 
acids) can be chemically synthesized and 
1 easier to arrange restriction sites in smaller i 

sailer than about 40 residues, an 
that the entire variegated pM gene 
can be synthesised in one piece, Xn that case, we need 
arrange only suitable restriction sites in the sm 
A smaller protein minimises the metabolic strain on the 
30 phage or the host of the GP. The 1PBD is preferably 
seller than about ms residues. The XPSD must also be 
large enough to have acceptable binding affinity and 
specificity. For m XWD lacking covaleut crosslinks, 
such as disulfide bonds, the IPSO is preferably at least 
3S 40 residues; it may be as small as six residues if it 
contains a crosslink. 
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There are many candidate XPBUs , for esssaq&le, bovine 
pancreatic trypsin inhibitor SS residues} t CffTI- 

III (29 residues } , craato&a C4S residues?), third domain of 
ovomucoid. (56 resMaeiS, ; heat- stable enterotoxin (ST~Xa 
S of B. colli ae residues), a-Canotoxi» GI (33 residues), 
M-Conotoxin (33: XI (22 residues} ,. Conns King Kong mini- 
protein (2? residues), T4 lysozyme (1S4 residues.) , and 
azurin (128 residues) . Table SO lists several preferred 
XPBDs . 

10 In soma cases,, a protein having some affinity for 

the target, may be a preferred I3?BD even though sows other 
criteria are not optimally met. For example, the VI 
domain of CD 4 is a good choice as XPBD for a protein that 
binds to gpX2Q of hxv. It is known that mutations in the 

IS region 42 to S3 of VI greatly affect gpI20 binding and 
that other mutations either have much leas effect or 
completely disrupt the structure of ¥1, Similarly, tumor 
necrosis factor CXW) would foe a good initial choice if 
one wants a TNF-like molecule having higher affinity for 

20 the THF receptor, 

As even surface mutations may reduce the stability 
of the PSD, the chosen XPBD should have a high melting 
temperature (SO'C acceptable, the higher the better; BPTI 
melts at 95 *C* and foe stable over a wide pH range (8.0 

2S to 3,0 acceptable ; li.o to 2.0 preferred), so that the 
SBDs derived from the chosen XPBD toy mutation and 
selection- through -binding will retain sufficient stabil- 
ity. Preferably, the substitutions in the XPBD yielding 
the various PSDs do not reduce the melting point of the 

30 domain below «40*€, Mutations may arise that increase the 
stability of SBDs relative to the XPBD , but the process 
of the present invention does not depend upon this 
occurring. Proteins containing eovalent crosslinks, such 
as multiple disulfides* are usually sufficiently stable. 

35 A protein having at least two disulfides and having at 
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least I ' disulfide, for m®t& twenty residues may be 
presumed to be sufficiently stable. 

If the target is * protein or other maeromolecnle a 
preferred e^bodimesafc of the TSW is a small protein sueh 
5 as the Cucurbits. siaxiraa trypsin inhibitor III (29 resi- 
dues) , BPTI from « - (SB residue©) , erambin from 
rape seed |4S residues) , or the third domain of ovomucoid 
from S&fcu.csi.x coturnlx ffaponlca (Japanese quail) I5S 
residues) , .because targets from this class have clefts 

10 and grooves that can accomodate sail proteins in highly 
specific ways. If the target is a macromol scale lacking 
a. compact structure, such as starch, it should be treated 
as if it ware a small molecule. Extended m&eromoieeules 
with defined 3D structure, such as collagen, should be 

IS treated as large molecules,. 

If the target is a small molecule, such as a 
steroid, a preferred embodiment of the IPBB is a protein 
of about 80-200 residues, such as rihonnclease from $£& 
tanrus (124 residues) , ribonuclease from hMStSSSlllm 

20 erus ae (104 residues), hen egg white lysozyms from. £sllu& 
SiULUS (129 residues) , asuzin f rom Mt^taSPJMS. a&sm&BQ&k 
(128 residues) , or T4 lysosyme (164 residues) , because 
such proteins have clefts and grooves into which the 
era.ll target molecules can fit. The BrooSdiaven Protein 

as Data Bank contains 3D structures for all of the proteins 
listed. Genes encoding proteins as large as T4 lysosyme 
can be stipulated by standard techniques for the 
purposes of this invention. 

If the target is a mineral, insoluble in water, one 

30 considers the nature of the molecular surface of the 
mineral. Minerals that have smooth surfaces, such as 
crystalline silicon, are best addressed with medium to 
large proteins, such as rihonnclease, as IPBD in. order to 
have sufficient contact area and specificity. Minerals 

35 with rough, grooved surfaces, such as zeolites, could he 



WO 92/15679 



31 

bound either by small proteins, such as BFTX, or larger 
proteins, such a© T4 Ijmozjm. 

BFTX is an especially preferred 1PB0 because it 
meets or exceeds all the criteria s it is a small, very 
3 stable protein with a wall known 3D structure. 

Small polypeptides have potential advantages over 
larger polypeptides when used as therapeutic or 
diagnostic agents, including (but not limited to) : 
a) better penetration into tissues, 
10 b) faster elimination from the circulation I important 
for imaging agents) , 
o) lower antigenicity,' and 
d) higher activity per mass. 

Thus, it would be desirable to be able to employ the 
IS combination of variegation and affinity selection to 
identify email polypeptides which Mad a target of 
choice. 

Polypeptides of this 'si*®, however, have disadvan- 
tages as binding molecules. According to Olivera jsfc 

28 (OLIVSOa) : "Peptides in this sisa range normally equi- 
librate among many conformations {in order to have a 
fixed conformation, proteins generally have to be much 
larger)." Specific binding of a peptide to a target 
molecule requires- the peptide to tajke up one conformation 

25 that is complementary to the bidding site. 

In one embodiment, the present invention overcomes 
these problems, while retaining the advantages of smaller 
polypeptides, by fostering the biosynthesis of novel 
mini.~protei.ns having the desired binding characteristics . 

30 Mini - Proteins are small polypeptides (usually, lees than 
about 60 residues ? more preferably lees than 40 residues 
{ ^micro-proteins") ) which, while too small to have a 
stable conformation as a result of noncovalent forces 
alone, are covalently crosslinked (e^o^,., by disulfide 

35 bonds) into a stable conformation and hence have 
biological activities acre typical of larger protein 
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molecules than of nneons trailed pe&ypqpSiQ&B of 
comparable size. 

When Bin! -proteins are -variegated, the residues 
which ar® eovalenfcly cr©sssli»ked in the parental molecule 
5 are left unchanged, thereby stabilizing- the conformation. 
For example, in the variegation of a disulfide bonded 
mini -protein, certain cysteines are invariant so that 
under the conditions of expression and display, eovalent 
crosslinks {&*£U» disulfide bonds between one or more 

10 pairs of cysteines } fom, and substantially constrain the 
conformation which my he adopted by the hypervariable 
linearly intermediate amino acids. la other words,, a 
constraining scaffolding is engineered into polypeptides 
which are otherwise extensively randomised, 

15 once a mini-protein of desired binding chaxmcter- 

istios is characterised, it may be produced, not only by 
recombinant Sffi techniques, but also by nosological 
synthetic methods. 

For the purpose of the appended claims, a mini- 

20 protein has between about eight and about 60 residues » 
m intrachain disulfide bridge connecting amino acids 3 
and 8 of a 16 residue polypeptide will be said herein to 
have a span of 4. If amino acids 4 and 12 are also 
disulfide bonded, then their bridge has a span of 7< 

25 Together, the four cysteines divide the polypeptide into 
four intercysteine segments (1-2, 5-7, 9-11, and 13-16) . 
(Note that there is no segment between Cys3 and Cys4,} 

The connectivity pattern of a crossl inked mini- 
protein is a simple description of the relative location 

30 of the termini of the crosslinks. For example, for a 
mini -protein with two disulfide bonds, the connectivity 
pattern 15 1-3, 2-4* meads that the first crossl inked 
cysteine is disulfide bonded to the third orosslinked 
cysteine {in the primary sequence) , and the second to the 

3S fourth. 
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The variegated di sulfide- bonded mini -proteins of the 
prese&fc invention, fall into several classes . 

£&ftS.g I,„M^i-p.mfc^i^ are those featuring a single 

pair of cysteines capable of interacting to form a 
5 disulfide bond, said bond having a span of no more than 
nine residues. This disulfide bridge preferably has a 
span of at least two residues; this is a function of the 
geometry of the disulfide bond, Mien the spacing is two 
or three residues, one residue is preferably glycine in 

10 order to reduce the strain on the bridged residues. Thfc 
upper limit on spacing is less precise., however, in 
general, the greater the spacing, the less the constraint 
on conformation imposed on the linearly intermediate 
amino acid residues by the disulfide hoxid, 

IS A disulfide bridge with a span of 4 or S is espe- 

cially preferred. If the span is increased to the 
constraining influence is reduced. In this case, we 
prefer that at least one of the enclosed residues tee an 
amino acid that imposes restrictions on the main- chain 

20 geometry, Proline imposes the most restriction, Valine 
and isoleucine restrict the main chain to a lesser 
extent. The preferred position for this constraining 
non~ cysteine residue is adjacent to one of the invariant 
cysteines, however, it may foe one of the other bridged 

25 residues. If the span is seven, we prefer to include two 
amino acids that limit main-- chain conformation. These 
amino acids could be at any of the seven positions, but 
are preferably the two bridged residues that axe 
immediately adjacent to the cysteines. If the span is 

30 eight or nine, additional constraining amino acids may be 
.M.dsd, 

Additional amino acids may appear on the amino side 
of the first cysteine or the carfooxy side of the second 
cysteine. Only the immediately proximate ''unspanned" 
35 amino aside are likely to have a significant effect, on 
the conformation of the span. 
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n nri, ^ .protein* are those- featuring a single 
disulfide bond having a span of greater than nine amino 
acids. The bridged amino acids form secondary structures 
which help to stabilise their conformation, Preferably, 
intermediate amino acids form hairpin 



-CFS-«feeli^-tisxn~B»strand-C> f s- 

j — — —j 
~Cj^-cvheldx»turn-«helix»€ys- 

5 ,_ s _g_ _ — , 

.-Cys~ 

m designing a suitable hairpin structure, c 
copy an actual structure from a protein whose 
dimensional conformation is known, design the structure 
vtsing secondary structure tendency data for the 
individual amino acids, etc., or combine the two 
approaches. Preferably, one or more actual structures 
are used as a model, and the frequency data is need to 
determine which mutations can be made without disrupting 
the structure. 

Preferably, no more than three amino acids lie 
betweeu the cysteine and the beginning or end of the & 
helix or S strand. 



flia^SJaStOSaiM are those featuring a 
plurality of disulfide bonds. They optionally may also 
feature secondary structures such as those discussed 
abow with regard to Class XI mini-proteins. Since the 
aumfoer of possible disulfide bond topologies increases 
rapidly with the number of bonds (two bonds, three 



topologies; three bonds, 15 topologies ? four bonds, 105 
topologies: the number of disulfide bonds preferably does 
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mot exceed- four.. T*$- .-disulfide bond ara preferable to 
three, and three to four. With two or more disulfide 
bonds , the disulfide bridge spans preferably do not 
exceed 30, and the largest intercysteine chain segment 
S pref er&foly does not exceed SO. 

naturally occurring class XIX mini -proteins, such as 
heat-stable antarotoxin S^-la, frequently have pairs Of 
cysteines that are clustered (-C-C- or -GSK-C-) in the 
amino- acid sequence, Clustering reduces the number of 
10 realisable topologies,, and amy be -advantageous « 

the present invention are not limited to those 
crosslinked by disulfide bonds, Another important class 
at mini-proteins are analogues of finger proteins, 

is. Finger proteins are characterized by finger structures in 
which a metal ion is coordinated by two Cys and two His 
residues, forming a tetrahedral arrajjgentent around it. 
The metal ion is most often zinc* XX) , hut may be iron, 
copper,, cobalt , etc . The * finger* has the consensus 

20 sequence (Phe or Tyr} - (X &&} -Cys- (2-4 AAs)-Cys~{3 Ms) - 
Phe- iB ma) -Leu- (2 ms) -His- B Ms) -His- 15 Afts) CSHRSaS; 
GXBS885 , The present invention encompasses mini -proteins 
With either ons or two fingers. 

Further diversity may fee introduced into a display phage 
25 library ofpotential binding domains by treating the phage 
with (preferably »<mfc<wd«c} enaayraes and/or chemical 
reagents that can selectively modify certain side groups 
of proteins, and thereby affect the binding properties of 
the displayed FBSs. Using affinity separation methods, 
30 we enrich for the modified. SPs that bind the 
predetermined target. Since the active binding domain is 
not entirely genetically specified, we must repeat the 
post -morphogenesis modification at each enrichment round. 
This approach is particularly appropriate with mini- 
35 protein XFBUs feseamse we envision chemical synthesis of 
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The present invention also relates to the 
identification of epitopie peptides which bind to a 
target which ie the epitopic binding site of an antibody, 
lectin, eMps, or other binding protein. In the case 
of an antibody, the epitopic peptide will be at least 
four amino acids and more preferably at least six or 
eight amino acids. Usually, it will be less than 20 
amino acids, tout there is no fixed upper limit. in 
general., however, the epitopic peptide will be a "linear* 
or "sequential « epitope. Typically, in constructing a 
library for displaying epitopic peptides, all or most of 
the amino acid positions of the potential epitope will he 
varied. However, it is desirable that among those amino 
acids allowed at a particular position., that there toe a 
relatively equal representation, as further discussed 
below in the context of mutagenesis of protein domains. 



TO 

wxsmzm jxjk&xhs cor epitopes) with desxiosd uxvsrsitt 

When the number of different amino acid sequences 
obtainable by mutation of the domain is large when 
compared to the number of different domains which are 
displayable in detectable amounts, the efficiency of the 
forced evolution is greatly enhanced by careful choice of 
which residues era to be varied. First, residues of a 
kuown protein which are likely to affect its binding 
activity (e.g.., surface residues) and not likely to 
unduly degrade its stability are identified- Than all or 
some of the eodons encoding these residues are varied 
simultaneously to produce a variegated population of DEL 
The variegated population of DRR is used to express a 
variety 'of potential binding domains, whose ability to 
bind the target of interest may then be evaluated. 

The method of the present invention is thus further 
distinguished from other methods in the nature of the 
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highly variegated population that is produced and from 
which novel binding proteins are selected. We force the 
displayed potential binding domain to saisple the nearby 
* sequence space* of related amino- acid sequences in an 
5 efficient,, organised muasr. Four goals guide the 
various variegation plane need herein, preferably 5 1} a 
vary large number fe.g, W 1 } of variants is available,, 2} 
a very feign percentage of the possible variants actually 
appears in detectable amounts, 3} the frequency of 

10 appearance of the desired variants is relatively uniform, 
and 4) variation occurs only at a limited number of 
amlnc-acid residues, most preferably at residues having 
side groups directed toward a common region on the 
surface of the potential binding domain, 

IS This is to ha distinguished from the simple use of 

indiscriminate mutagenic agents such as radiation and 
hydroKylamine to modify a gene, where there is no for 
very oblique) control over the site of mutation, Many of 
the mutations will affect residues that are not a part of 

20 the binding domain. Moreover, since at a reasonable 
level of mutagenesis, any modified codon is likely to be 
characterised by a single base change., only a limited and 
biased range of possibilities will be explored. Equally 
remote is the use of site -specific mutagenesis techniques 

25 employing mutagenic oligonucleotides of nonrandomized 
sequence., since these techniques do not lend themselves 
to the production and testing of a large number of 
variants, while focused random mutagenesis techniques 
are known,, the importance of controlling the distribution 

30 of variation has been largely overlooked. 

The term "variegated Ml 8 (ygpm) refers to a 
mixture of Dm molecules of the same or similar length 
which, when aligned, vary at some eodons so as to encode 
at each such codon a plurality of different amino acids, 

35 but which encode only a single amino acid at other codon 
positions. It is further understood that in variegated 
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the codoas which are variable, and the range and 
frequency of occurrence of the different amino acids 
which a given variable codon encodes, are determined in 
advance by the synthesizer of the even though the 

5 synthetic method does not allow one to know, a priori, 
the sequence of any individual DHA molecule in the 
mixture. The number of designated variable codecs in the 
variegated DMA is preferably no more than 20 codon®, and 
more preferably no more than S~iO codoasu The mix of 

10 axaino acids encoded at each, variable codon asay differ 
from codon to cocoa, A population of display phage into 
which variegated BH& has been introduced is likewise said 
to foe * variegated" - 

Mien DN& encoding a portion of a known domain of a 

IS protein is variegated, the original domain is called the 
parent of the potential binding domains (PPBD) , and the 
multitude of mutant dossains encoded as a result of the 
variegation are collectively called the "potential 
binding domains" {FSB} , as their ability to bind to the 

20 predetermined target is not then Joiowau 

We now consider the manner in which we generate a 
diverse population of potential binding domains in order 
to facilitate selection of a PBD -bearing phage which 
binds with the requisite affinity to the target of 

25 choice. The potential binding domains are first designed 
at the amino acid level. Once we have identified which 
residues are to be mutagenized, and which mutations to 
allow at thorn positions, we may then design the 
variegated D». which is to encode the various 3?BDs so as 

30 to assure that there is a reasonable probability that if 
a PBD has an affinity for the target, it will be 
detected. Of course, the number of independent 
trans ferments obtained and the sensitivity of the 
affinity separation technology will impose limits on the 

35 extent of variegation possible within any single round of 
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There are many ways to generate diversity in a 
protein, At ox?® extr«iffi f we vary a few residues of the 
protein as roach as possible C "Focused Mutagenesis" ) , 
e.g., *e pick a eat of five to seven residues and vary 
each through 13*20 possibilities An alternative plan of 
mutagenesis ("Diffuse Mutagenesis* } is to vary asany more 
more limited set of choices (See 
6) . The variegation, pattern adopted may 

fall 

10 there is no fixed limit on the number of codone 

which can .be mutated simultaneously. However, it is 
desirable to adopt a mutagenesis strategy which results 
in a reasonable probability that a possible PBD sequence 
is in fact displayed by at least one phage, Preferably, 

15 the probability that a mutein encoded by the ygBM and 
composed of the least favored amino acids at each 
variegated position will be displayed by at least one 
independent transforms^ in the library is at least 0*50, 
and more preferably at least 0.90. (Muteins composed of 

20 more favored amino acids would of course be more likely 
to occur in the same library,} 

Preferably, the variegation is such as will cause a 
typical transformant population to display 10 s - ID 7 
different amino acid sequences by means of preferably not 

25 more than 10- fold more (more preferably not more than 3- 
fold) different DMA sequences. 

For a mini-protein that lacks at helices and £ 
strands, one will, in any given round of mutation, 
preferably variegate each of 4-6 non- cysteine cocoas so 

30 that they each encode at least eight of the 20 possible 
amino acids. S&e variegation at each codon could be 
customised to that position. Preferably, cysteine is not 
one of the potential substitutions, though it is not 



When the mini -protein is a metal finger protein, in 
a typical variegation strategy, the two Cys and two His 
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residues , and optionally also the aforementioned Phe/Tyr, 
Bhe mxuS. Leu residues, are held invariant and a plurality 
(usually 5-10} of fcise other residues are varied. 

When the mini-protein is of the type featuring one 
5 or more « helices and B strands, the set of potential 
amino acid modif icatians at any given position is picked 
to favor those which are lees likely to disrupt the 
secondary structure at that position. Since the number 
of possibilities at each variable amino acid is mors 

10 limited, the total nimsfeer of variable amino acids amy toe 
greater without altering the sampling efficiency of the 
selection process. 

For the last -mentioned class of mini- proteins, as 
well as domains other than mini -proteins, preferably not 

IS more than 20 and more preferably S~10 codons will fee 
variegated. However, if diffuse mutagenesis is employed, 
the number of codons which are variegated can be higher . 

The decision as to which residues to modify is eased 
by knowledge of which, residues lie on the surface of the 

20 domain and which are Juried in the interior. 

We choose residues in the FFBD to vary throng 
consideration of several factors, including x a) the 3D 
structure of the PPSD, to) sequence© homologous to PPBD, 
and c) modeling of the PPBD and smtants of the Pmt>. 

25 Mien the number of residues that could strongly influence 
binding without preventing the normal folding of the B» 
is greater than the number that should be varied 
simultaneously, the user should pick a subset of those 
residues to vary at one time. The user picks trial 

30 levels of variegation and: calculate the abundances of 
various sequences. The list of varied residues and the 
level of variegation at each varied residue are adjusted 
until the composite variegation is eommensiirate with the 
sensitivity of the affinity separation and the number of 

35 independent transformants that can be made. 
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Having picked which residues to vary, we now decide 
the range of amino acids to allow at each variable 
residue. The total level of variegation is the product 
of the number of variants at each varied residue, Bach 
S varied residue can have a different scheme of 
variegation, producing 2 to 20 different possibilities. 
The set of amino acids which are potentially encoded by 
a given variegated codou are celled its ^substitution 
set", 

10 The computer that controls a BN& synthesiser, such 

as the Milligen 7500, can foe programed to synthesis© any 
base of an oligo-nt with any distribution of nts by 
taking some at substrates * e-d« at phoepfcoramidites) from 
each ©f two or more reservoirs. Alternatively, nt 

15 substrates can be mixed in any ratios and placed in one 
of the extra reservoir for so called "dirty bottle* 
synthesis . Bach codon could be programmed differently. 
The 8 Mx s of bases at each nucleotide position of the 
codon determines the relative frequency of occurrence of 

20 the different amino acids encoded by that codon. 

Simply variegated codoas are those in which those 
nucleotide positions which are degenerate are obtained 
from a mixture of two or more bases mixed in aquimolar 
proportions . These mixtures are described in this 

2S specification by means of the standardised "ambiguous 
nucleotide* code. In this code, for example, in the 
degenerate codon *S2fr* f *S* denotes an aguimolar mixture 
of bases G and C, *$*, an equimolar mixture of ail four 
bases, acd *T* # the single invariant base thymidine „ 

30 Complexly variegated codons are those in which at 

least one of the three positions is filled by a base from 
an other than equimoiar mixture of two of more bases . 

Either simply or complexly variegated codons may be 
used to achieve the desired substitution set, 

35 if we nave no information indicating that a parti- 

cular amino acid or class of amino acid is appropriate, 
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we strive to substitute all amino acids with equal 
probability because representation of one mini -protein 
above the detectable is wasteful. Equal amounts of 

all four nta at each position, in a cote yields the 

amino acid distribution in which ©acfc amino acid is 
present in proportion to the number of codoss that code 
for it. This distribution has the disadvantage of giving 
two basic residues for awy acidic residue. In 
addition, six times ae much R, and h as «r or M occur. 
X£ five codone are synthesized with this distribution, 
each of the 243 sequences encoding some eombioatian of L, 
a, and S are 7776 -times more abundant than each of the 32 
sequences encoding some combination of w and M. To hav& 
five ws present at detectable levels, we must have each 
of the eeguenoes present in 7776 -fold excess. 

Xt is generally accepted that the sequence of amino 
acids in a protein or polypeptide determine the three- 
dimensional structure of the molecule, including the 
possibility of no definite structure. Among polypeptides 
of definite length and sequence, some have a defined 
tertiary structure and most do not. 

Particular amino acid residues can influence the 
tertiary structure, of a defined polypeptide in several 
ways, including by; 

a) affecting the flexibility of the polypeptide main 



fa) adding hydrophobic groups, 

c) adding charged groups, 

d) allowing hydrogen bonds, and 

30 e) forming cross -links, such as disulfides, chelation 
to metal ions, or bonding to prosthetic groups. 

SLY is the smallest amino acid, having two hydrogens 
attached to the q.. Because GLY has no C s , it confers the 
35 most flexibility on the main chain. Thus GhY occurs very 
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frequently in reverse: taiSfcs, particularly in conjunction 
with PRO, ASP, ASK, SEE., and THE, 

The ami.no acids MA, SEE, CYS, ASP, MW t hW3 t MET? 
FEE, TYE, TEP, AES, HIS, GISJ, GhH, and LYS have 
S unbraoehed & carbons. Of these, the side groups of SBR, 
ASP, and ASH frequently sake hydrogen bonds to the main 
chain and so can take cm. main- chain conformations that 
are energetically unfavorable for the others, ILB, 
and THE have branched & carbons which mates the extended 

10 main- chain conformation more favorable. Thus VAX. and ILB 
are most often seen in & sheets , Because the side group 
of THE can easily form hydrogen bonds to the main chain* 
it has less tendency to exist in a S sheet , 

The main chain of proline is particularly 

15 constrained by the cyclic side group. The # angle is 
always close to -SO*, Most prolines are found near the 
surface of the protein. 
Charge 

LYS and ARG carry a single positive charge at any pH 

20 below 10,4 or 12,0, respectively, Nevertheless, the 
methylene groups, four and three respectively, of these 
amino acids are capable of hydrophobic interactions. The 
guanidinium group of AEG is capable of donating five 
hydrogens simultaneously, while the atd.no group of LYS 

2S can donate only three. Furthermore, the . geometries of 
these groups is quite different, so that these groups are 
often not interchangeable. 

ASP and <3hU carry a single negative charge at any pB 
above **4,S and 4,6, respectively. Because ASS* has but 

30 one methylene group, few hydrophobic interactions are 
possible. The geometry of AS I? lends itself to forming 
hydrogen bonds to main- chain nitrogens which is 
consistent with ASP being found very often in reverse 
turns and at the beginning of helices, G2/J is more often 

3S found in a helices and particularly in the amino- terminal 
portion of these helices because the negative charge of 
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tjie side group has a stabilising interaction with th« 
helix dipole {MICKS 8, S&bX88) . 

HIS has an ionization pK in the physiological range,, 
via. 6.2. This $K oan toe altered by the proximity of 
charged groups or of hydrogen donators or acceptors . HIS 
is capable of forming bonds to metal ions such as sine 
copper, and iron. 

Aside from the charged amino acids, SBR, TBR, ASH, 
am, TfR, and TE? can participate in hydrogen bonds. 



: important form of cross link is the disul- 
fide bond formed between two thiols, especially the 
thiols of CYS residues. In a suitably oxidizing environ- 
ment, these .bonds form spontaneously. These bonds can 
greatly stabilise a particular conformation of a protein 
or mini -protein. When a mixture of oxidized and reduced 
thiol reagents are present, exchange reactions take place 
that allow the most stable conformation to predominate, 
Concerning disulfides in proteins and peptides, see also 
mi!Z9Q f ¥JkT8B3 f PEER84 f P1RR8S, S.MJE86, WELLS 6, JJa*&89 .< 
HOEVBS , KXSH8S, and SCBHB6, 

Other cross links that form without need of specific 
susymes include; 

1! CCT3) 4? Pe Euhredoxin (in CREX84, P.376) 

2} ms) 4 i%rx Aspartate Transearbamylase (in 

Cmtm, P.37S3 and In-flngers 

3 ) ( H I B } 3 { M E T ) { C Y S ) % C u 
Aznrin (in CRBX84, P. 376} and Basic "Slue" Cu 
Cuonmber protein (GtlSSas) 
4} (HIS)*jOi Cam superoxide dismutase 

5) (CXS.)*s CF%S 4 ) ferredoxte tm CREX84, P. 3?€) 

6) (Cf$} t WZB)$tm Sine-fingers (GXBSiS) 

7} (CyS}*\HlS) :%a Sine-fingers (GMJSS7, SXB888) 
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Cross links having fHIS) s »T) (mm :Cu has the potential 
advantage that HIS aad MET can not form other cross links 
without Cu. 

aimply Variegated Codoas 

5 The following sinply- variegated eodons are useful 

because they encode a relatively balanced set of amino 
acids t 

1} sm which encodes the set Ch,3?,K,.R,V,A,»,G3' j a) on© 
acidic CP) and one basic CR) , b) both aliphatic 
10 {L, V} and aromatic hydrophobics (H) » c) large 

Cli f E f H) and small (G,h) side groups, d> ridged CP) 
and flexible {&} amino acids, a) each amino acid 



RHG which encodes the set flK>T,K,R,V,A,B,GJ ,s a) one 
acidic and two basic (not optimal, but acceptable) , 
h) hydrophilics and hydrophobics, c) each amino acid 



set (T,K f A,S| : a) one acidic, 
one basic, one neutral hy&rophiiic, h) three favor 
20 a helices, c) each amino acid encoded once, 

4} WT which encodes the set th, P,H,R, x,T,N,8,v,A,D, Gl t 
a) one acidic, one basic, b) all classes ? charged, 
neutral hydrophilic, hydrophobic, ridged and flex- 
ible, etc, , q) each amino acid encoded once, 

the set W*St&*& t P t %®*} ; a) two 
b) two .neutral hydrophilics, o) 
only glycine encoded twice. 

HIT which encodes t h e s e t 
CP, S , Y,. C., L, P, H, R, X,T,N, V, &, D,G) J a) sixteen DMA 
sequences provide fifteen different assioo acids? 
only serine is repeated, all others are present in 
equal amomnts (This allows very efficient sampling 
of the library. ) , b) there are equal numbers of 
acidic and basic amino acids CD. and E, once each) , 
c) all major classes of amino acids are present; 
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acidic, basic, aXipfcatic h; i ; u~bie f aromatic 
hydrophobic, and neutral hytetpfcilic, 
7} 8 88 , w h i e h e a c o d e s t h e s e t 
tL* , R*» S , W, P, Q, M, S, £, »top3 : a} fair 
S preponderance of residues that favor formation of es~ 

Helices [I*,H,A,Q, K,B? a»d, to a lesser extent, 
ByRtTls b> encode 13 different amino acids. {VHSS 
encodes a subset of the eat encoded toy W® which 
encodes 9 amino acids in nine different DMA 
10 sequences, with equal acids and bases, and 5/9 being 

•« helix- favoring.) 

variegation, smt is preferred, in 
when the codon is encoding an aiaino 
to be incorporated into an a helix, vm is 



IS 

Below, we analyse several simple variegations as to 
the efficiency with which the libraries can be sampled. 

Libraries of random h&xapeptides encoded by (SNK)' 
have been reported (SO0T90, CWX890) . Table 130 shows the 

20 expected behavior of such libraries, NKK produces single 
COdons for PBB, m ( CYS, *m>, HIS, GXM, XHB, MET, 
XatSt -MSOf, and GLtr (« set}? two codons for each of V&t*., 
MA t PRO, THE, and G&Y {*. set) ? and three eodons for each 
of I*SU, MS, and SEE CO set} « We have separated the 

25 €4,000,000 possible sequences into 28 classes, shown in 
Table 13 OA, based on the number of amino acids from each 
of these sets* The largest class is moimm with »14.€% 
of the possible sequences, Aside from any selection, all 
the sequences in one class have the same probability of 

30 being produced. Table X3GB shows the probability that a 
given m& sequence taken from the (NNX)* library will 
encode a heseapeptide belonging to one of the defined 
classes i note that only ~6.3tr of MSK sequences belong to 
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Table 130C shows the expected numbers of sequences 
in each class for libraries containing various numbers of 
independent trans foW»t» lia,, 10*. 3-10*, X0\ 3*X0 ? , 10 s f 
3-10 1 , 10*, and 3*10*). at 10 s independent transiormants 
S (XTs) , we expect to see 5S% of the class, but only 

0,1% of the amtx&a class. The vast majority of sequences 
seen come from classes for which less than 10% of the 
class is sampled. Suppose a peptide from, for example, 
class **oaasa is isolated by fractionating the library for 

10 binding to a target. Consider how much we 'knew about 
peptides that are related to the isolated sequence, 
Because only 4% of the MQOmt class was sailed, we can 
not conclude that the amino acids from the 0 set are in 
fact, the best from the 0 set. We might have im at 

15 position 2, but MS or SEE could ha better. Even if we 
isolate a peptide of the OOSOQO class, there is a notice- 
able chance that better members of the class were not 
present in the library. 

With a library of 10' XTs., we see that several 

20 classes have been completely sampled, hut that the txctmxm 
class is only 1.1% sampled. At 7.6-10 7 XTs, we expect 
display of 50% of all amino- acid sequences, but the 
classes containing three or more amino acids of the of set 
are still poorly sampled. To achieve complete sampling 

25 of the iwm)* library requires about 3*10* XTs, 10 -fold 
larger than the largest immp library so far reported. 

Table 131 shows esq >ns for a library encoded 

by (NST} 4 {KNG) S > The expec cations of abundance are 
independent of the order of the eodons or of interspersed 

30 unvaried codons . This library encodes 0.133 times as 
many amino- acid sequences, but there are only 0.0165 
times as many sequences. Thus 5.0*10 7 XTs S0~ 
fold fewer than required for gives almost complete 

sampling of the library. The results would be slightly 

35 better for {SHD* and slightly, but not much, worse for 
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sequences for eodeaas mSK f Wft f and For 1 

we have &ssxm$& that the PBD is displayed as part of an. 
essential gene, such, as gene XXI in Ff phage f as is 
indicated by the phrase * assuming stops vanish" It is 
not in any way required that such an essential gene be 
used, if a non-essential gene is used, the analysis 
would be slightly different? sampling of mm and W3 
would be slightly less efficient. Note that <W}« gives 
3, 6- fold 

X. 7-fold .t;<t* 
? gives twice as 3 
IS tot 3. 3 -fold £SS2S2E HN 

Thus, while it is possible to 
tms* mtK or mm) to obtain at a particular position all 
twenty 

highly biased set. of 

20 can be overcome by use of complexly variegated eo&ons. 
We first will present the mixture calculated (as 
¥09 0/02809 1 to minimise the ratio of most favored 
acid to least favored amino acid when the nt distribution 
is subject to two constraints s equal abundances of 

25 acidic and basic amino acids and the least possible 
number of stop codons. We have simplified the search for 
an optimal at distribution by limiting the third base to 
T or 6 (C or a is equivdlent) » However, it should be 
noted that the present invention embraces use of 

30 complexly variegated codohs in which the third base is 
not limited to T or G (or to C or 0) . 

The optimum distribution (the «fxS« codec) is shown 
in Table 10a and yields DBA molecules encoding each type 
amino acid with the abundances shown. Note that this 

35 chemistry encodes all twenty amino acids., with acidic and 
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basic amino acids being sajuiprobafele, and the most 
Savored amicus acid (serin©) is :eneo&ed only a. 454 times 
as often as the least favored amino acid { tryptophan J , 
m* "fxS« vg codon iasproves sampling most for peptides 
5 containing several of the amino acids £F,Y,C,W,H» 
,Q,I,M,H f &\d,e] for which hhk or ms provide only one 
codon. Its sampling advantages are most pronounced when 
the library is relatively small. 

The results of searhiag only tor the complexly 

10 variegated eodon which winimises the ratio of most 
favored to least favored amino acid, without additional 
constraints, is shown in Table 10B, The changes era 
small , indicating that insisting on equality of acids and 
bases and minimising stop codona costs us little. Mao 

IS note that, without restraining the optimisation, the 
prevalence of acidic and basic amino acids cornea out 
fairly close. On the other hand, relaxing the 
restriction leaves a distribution in which the least 
favored amino acid ia only .412 times as prevalent as 

The advantages of an SSST codon are discussed else- 
where in the present application. unopt laired NNT 
provides IS amino acids encoded by only IS DM& sequences . 
It is possible to improve on mr with the complexly 

25 variegated codon shown, in Table IOC, This gives five 
amino acids (SHE, LEU, MXB f vsx*, in very nearly 

equal amounts. A further eight amino acids (SHE, TXR, 
TJM t &S® S PEG, M ( ARG, Gh¥) are present at 78% the 
abundance of SEE. THE and CYS remain at half the abun- 

30 dance of SER. When variegating m& for disulfide -bonded 
mini "proteins, it is often desirable to reduce the 
prevalence of CYS, This distribution allows 13 amino 
acids to foe seen at high level and gives no stops? the 
optimised fxS distribution allows only 11 amino acids at 

35 high prevalence. 
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The mm codon can also be optimised, mm. eqnimolar 
T,C,A,6 are used is* «b one obtains doufcle doses of LEU 
and AKG. Table 10D shows an approximately optimised 
codon. There are, under tMs variegation,, four equally 
5 most favored amino acids: hSU, &RG f MA, and SLU, Note 
that there is one acidic and one basic amino acid in this 
set. Tiiere are two equally least favored amino acids: 
TES? and MBT. The ratio of Ifaa/mfaa is 0,5253. If this 
codon is repeated six times,- peptides composed entirely 
10 of TRP and MET are 2% as 
entirely of 
this as s th< 
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{TR?/KEST}* in optimised HM1 5 



by the * dirty bottle * 
it is sometimes desirable to use only a limited 

„„„■ of mixes. One very useful mixture is called the; 

"optimised HNS mixture* in which we average the first two 
positions of the fxS mixture: T s * 0.24, C* . 0,17, % m 
0.33,. S, - 0.26, the second position is identical to the 
m first, C s » % « O.S. This distribution provides the 
amino acids *RG, SEE, &W t QhY, TO, *HR> ASK, and X*£$ at 
greater than 5% plus MA, M£>, QUS, X.£& MET, asd SR at 
greater than 4%. 

An additional complexly variegated codec is of 
25 Interest, This codon is identical to the optimised KKT 
codon at the first tw positions and has TiG:;SO;10 at 
the third position. This codon provides thirteen amino 
acids [Mi&i XU, ARG, SEE, LEO, TO, P8E, *8& Sfe 

PRO, TYE, and HIS) at more than S.S%. THR at 4,3% and 
30 CYS at 3,9% are more common than the hFAIts of hW 
(3.125%) . The remaining five amino acids are present at 
less than VI. This codon has the feature that all amino 
acids are present; sciences having more than two of the 
iow«abundance amino acids are rare. When we isolate an 
35 sbd using this codon, wa can he reasonably sure that the 
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first 3.3 amino acids were tested at each position. A 
similar codon, baaed on optimised WG ( could toe used. 

Several, of the preferred simple or complex 
•variegated, cocoas encode & set of amino acids which 
S includes cysteine. This means that some of the encoded 
binding domains will feature one or more cysteines in 
addition to the invariant disulfide -bonded cysteines. 
For example, at each sans encoded position, there is a one 
in sixteen chance of obtaining cysteine. If six codons 
10 are so varied, the fraction of domains containing 
additional cysteines is 0*3 : 3k* Odd numbers of cysteines 
can lead to complications, see Serry and Wetcei {FE&R84; „ 
On the other hand, many disulfide- containing proteins 
contain cysteines that do not form disulfides, 
IS trypsin. The possibility of unpaired cysteines can he 
dealt with in several ways* 

First, the variegated phage population can he passed 
over an immobilissed reagent that strongly hinds free 
thiols, such as Sulfoldnk {catalogue number 44895 H from 
20 Pierce Chemical Company, Rockford, Illinois, S110S) . 
Another product from Pierce is TNB~Thioi Agarose (Cata- 
logue Code 20409 HI « BioRad sells M f i-Gel 401 
( catalogue 153-4599} for this purpose. 

Second, one can use a variegation that excludes 
25 cysteines, such as; 

kht that gives p?\S, y,l,!?,b, x,t,n,v,a,»J , 

VHS that gives 

th* , F* ,B,Q,R 3 ,X,M,T* ,N,K,S,V» ,A*,S,D y <3* j , 
MSG that gives th*,S,W,.P,Q f R*,M f T,K,E,V f A,E,G,stop3 , 
30 SHT that gives £h, J?,H,R,V, A, D,G] ., 

RBS that gives ft, Ti, A, 8,63 , 

mm that gives IT,K,A,EJ, 

W that gives |h,F f H,K,l,T,N, S,Y,A,D,aj , or 
RES that gives !3?,S,£,R*B,E,©*3 . 
35 However, each of these schemes has one or more of the 
disadvantages, relative to M^ss a) fewer amino acids are 
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allowed, b) amino : acids are sot evenly proviW f c) 
acidic and basic amino acids are mot equally likely) , or 
d) stop codons occur, nonetheless, W, MHT, and W are 
almost as useful as mr. « encodes 13 different amino 
acids and one stop signal. Only two amino acids appear 
twice in the 18- fold mix, 

Thirdly, one can enrich the population for binding 
to the preselected target, and evaluate selected 
sequences seat for extra cysteines. Those that 

contain more cysteines than the cysteines provided for 
confionuational constraint say be perfectly usable, it is 
possible that a disulfide linkage other than the designed 
one will occur. This does not mean that the binding 
domain def ined by the isolated 3)ffi sequence is in any way 
unsuitable, The suitability of the isolated domains is 
best determined toy chemical and biochemical evaluation of 
chemically synthesissad peptides. 

Lastly, one can block free thiols with reagents, 
such as Ellman's reagent, lodoacetata, or methyl iodide, 
that specifically bind free thiols ana that do not react 
with disulfides, and then leave the modified phage in the 
population. It is to be understood that the blocking 
agent may alter the binding properties of the mini- 
protein? thus, one .might use a variety of blocking 
reagent in expectation that different binding domains 
will be found. The variegated population of thiol - 
blocked display phage are fractionated for binding, if 
the m& sequence of the isolated binding mini-protein 
contains an odd number of cysteines , then synthetic means 
are used to prepare mini -proteins having each possible 
linkage and in whieh the odd thiol is appropriately 
blocked, Nishiuehi (HISHS2, NJSB8S, and works cited 
therein) disclose methods of synthesizing peptides that 
contain a plurality of cysteines so that each thiol is 
protected with a different, type of blocking group- These 
groups can be selectively removed so that the disulfide 
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pairing can he controlled. We envision using such a 
schema with the alteration that one thiol either remains 
blocked, or is unblocked and then rehlocksd with a 
different reagent » 
5 Use of msf or HI variegated codecs leads to very 

efficient sampling of variegated libraries because the 
ratio of {different amiho-aeid sequences) / (different mh 
sequences) is mush closer to unity than it is for m$K or 
even the optimised vg codon (fxS) , Nevertheless, a few 

ID amino acids are omitted in each case. Both NJJT and HNG 
allow members of all important classes of amino acids ; 
hydrophobic, hydrophilic .« acidic f basic,, neutral 
hydrophilic, small, and large. After selecting a binding 
domain, a subsequent variegation and sal action may be 

IS desirable to achieve a higher affinity or specificity, 
During this second variegation., amino acid possibilities 
overlooked by the preceding variegation may be 
investigated, 

in the second round of variegation, a preferred 

20 strategy is to vary each position through a new set of 
residues which includes the amino acid i s > which were 
found at that position in the successful binding domains, 
and which include as xsany as possible of the residues 
which ware excluded in the first round of variegation. 

2S Thus, later rounds of variegation test both amino 

acid positions not previously mutated, and amino acid 
substitutions at a previously .mutated position which were 
not within the previous substitution set. 

If the first round of variegation is entirely 

30 unsuccessful, a different pattern of variegation should 
be used. For example, if more than one interaction set- 
can be defined within a domain, the residues varied in 
the next round of variegation should be from a different 
set than that probed, in the initial variegation. If 

35 repeated failures are encountered t one may switch to a 
different IFBD, 
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Affinity separation is used initially in the present 
invention to verify that tbe display system is working, 
i , e , ,■ that a cMsserie outer snrface protein has been 
S expressed and transported to th& surface of the phage and 
is oriented so that the inserted binding domain is 
accessible to target material. men •used for this 
purpose, the binding domain is a known binding dosais for 
a particular target and that target is the affinity 

10 molecule need in the affinity separation process. For 
example, a display system my be validated by using 
inserting m& encoding- BPTX into a gene encoding an outer 
surface protein of the phage of interest, and testing for 
binding to anhydro trypsin, which is normally bound by 

15 BFTX, 

If the phage bind to the target, then we have 
confirmation that the corresponding binding domain is 
indeed displayed by the phage. phage which display the 
binding domain (end thereby bind the target) are 

20 separated from those which do net. 

Once the display system is validated, it is possible 
to use a variegated population of phage which display a 
variety of different potential binding domains, and use 
affinity separation technology to determine how well they 

as hind to one or ©ore targets, This target need not be one 
bound by a known binding domain which is parental to the 
displayed binding domains, i.e.. one may select for 
binding to a new target . 

For example, one may variegate a BPTI binding domain 

30 and test for binding, not to trypsin, but to another 
serine protease, such as hnsaan neutrophil elastase or 
cathepsin S, or even to a wholly unrelated target, such 
as horse heart myoglbbin. 

The term "affinity separation means* includes, but 

35 is not limited tot a) affinity eoluEin chromatography, b) 
batch elution from an affinity matrix material, c) batch 
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elytipsi from an affinity asaterial attached to a plate, d) 
fluorescence activated cell, sorting, and e) electrophor- 
esis Ik the presence of target material, "Affinity 
material* is used to mean a material with affinity for 
s the material to foe purified, called the "analyte". In 
most cases, the association of the affinity material and 
the analyte is reversible so that the analyte can foe 
freed from the affinity material once the impurities are 
washed away. 

10 If affinity chromatography is to foe used, them 

1} the molecules of the target material must be of 
sufficient size and chemical reactivity to be 
applied to a solid support suitable for affinity 
separation, 

IB 2) after application to a matrix, the target material 
preferably does not react with water, 
after application to a matrix, the target material 
preferably does not hind or degrade proteins in a 
non-specific way, and 

20 4} the molecules of the target arterial must be suffi- 
ciently large that attaching the material to a 



ally at least S00 k* , excluding the atom that is 
connected to the linker) for protein binding.' 
2S affinity chromatography is the preferred separation 

means, but FAGS, electrophoresis, or other means saay also 
foe used. 

The present invention make* use of affinity separa- 
tion ©f phage to enrich a population for those phage 
30 carrying genes that code for proteins with desirable 
binding properties. 

The present invention may foe used to select for 
.binding domains which hind to one or more target mater- 
ials, and/or fail to bind to one or more target 
35 materials. Specificity, of course, is the ability of a 
binding molecule to bind strongly to a limited set of 
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target materials, while binding mora weakly or not at all 
to another sat of target materials from which the first 
sat. mast fee distinguished* 

Almost, aay molecule that is suitable for affinity 
5 separation may be used as a target. Possible targets 
include, .but are not limited to peptides, soluble and 
insoluble proteins, nucleic acids, lipids, carbohydrates, 
other organic molecules fmonomeric or polymeric), 
inorganic, compounds, and orgaaometaiiic compounds. 

10 Serine proteases are an especially interesting class of 
potential target materials. 

For chromatography, FAGS, or electrophoresis there 
may fee a need to eovalently link the target material to 
a second chemical entity. For chromatography the second 

IS entity is a matrix for W&CS the second entity is a 
fluorescent dye, and for electrophoresis the second 
entity is a sts scale. In. many cases , no 

coupling is required because the target laateriaX already 
has the desired property off a) immobility, b) f lucres- 

20 pence, or c) charge, Xn other cases, chemical or 
physical coupling is required- 
It is not necessary that the actual target material 
be used in preparing the imebtltzed or labeled analogue 
that is to be used in affinity separation; rather, 

25 suitable reactive analogues of the target, material may hs 
more convenient. Target materials that do not haw 
reactive functional groups may ha immobilized by first 
creating a reactive functional group through the use of 
some, powerful reagent, such as a halogen. In some cases, 

30 the reactive groups of the actual target material may 
occupy a part on the target molecule that is to toe left 
undisturbed- In that case, additional functional groups 
may toe introduced by synthetic chemistry- 

Two very general methods of immobilisation are 

35 widely used. The first is to biotinylate the compound of 
interest and then bind the biotlnylated derivative to 
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immobilized avidin. The second method is to generate 
antibodies to the target material, immobilize the anti •• 
bodies by any of numerous methods, and then bind the 
target material to the immobilised antibodies. Use of 
S antibodies is more appropriate for larger target materi- 
als; stall targets { those comprising, for example , ten or 
fewer non- hydrogen atoms) may be so completely engulfed 
by an antibody that very little of the target is exposed 
in the target -antibody complex. 
10 8©n~covmle»t immobilization of hydrophobic molecules 

without resort to antibodies amy also be used, A com- 
pound, such as 2 f 3 , 3-trimethyidecan© is blended with a 
matrix precursor, such as sodium alginate, and the 
mixture is extruded into a hardening solution* The 
1.5 resulting beads will have 2,3, S-rrimethyldecane dispersed 
throughout, and exposed on the surface. 

Other immobilization methods depend on the presence 
of particular cbemieal functionalities, A polypeptide 
will present -m^ m- terminal; Lysines), » »COOH fcCHWS- 
20 minai? Aspartic Acids; Glutamic Acids) > -OH (Serines; 

©5 , and -SH (Cysteines) . A polysac- 




SSS" Derivatives"™'©? 2,4,6-trinitro benzene 

30 sulfonates (TUBS) ,- p. IX) 

r-HSU Carboxylic acid anhydrides, sua*.. 

derivatives of succinic anhyd 
ami sic anhydride, citraconic anhydride 
35 ^CREXM, p. 11} 

r-HB, Aldehydes that form reducible Schiff 

p . 12 } 

cyclohexaneaione derivatives CCRSI84, 
40 p. 14} 
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E-COjH Diazes cmpde {CKRX84> p. ID) 

R-CCV Epoxides (CRBXB4, p. 3.0) 

r-OH Carhoxylic acid anhydrides 

Axyl-OH Cartooxriio acid anhydrides 



Bsnsvi hali&e and suXieuyl h&lides 
(002X84, 



p. 19} 

R-SH »--alkyliaal«imide8 (CREI84, p. 21) 

r-SH ethylenaimiaa derivatives CCREX84, 

fi. 21} 



K-SH 

Thiol ethers 



Aldehydes Oxidise to COOH, aflfls. 

MO^K convert so R-S0jCi s 




30 R-POjH Concert to R-PC\C1 and react with 

ixnmobilisied alcohol or amine. 

CC double bonds Md HSr and then make amine or thiol . 

35 

The extensive literature on affinity chromatography 
and related techniques! will provide further examples. 

Matrices suitable for use as support materials 
include polystyrene, glass, agarose and other chrosiato- 

40 graphic supports, and w&y fee fabricated into beads, 
sheets, columns , wells, and other Sanaa as desired. 
Suppliers of support material for affinity chromatography 
include : Applied Protein Technologies Cambridge, Ma,* 
Sio-R&d Laboratories, RockvilXe Center, KY? Fierce 

m Chemical Gasxpmt* Eodkford, IS.*. Target materials are 
attached to the matrix in accord with the directions of 
the manufacturer of each matrix preparation witfe 
consideration of good presentation of the target. 
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arly in the selection process , relatively high 
of target materials may be applied to the 
matrix to facilitate bia&Lag-i target concentrations may 
subsequently be reduced to select for higher affinity 

The population of display phage is applied to an 
affinity matrix under conditions compatible with the 
intended use of the binding protein and the population is 
f ractionated by passage of a gradient of some solute over 

10 the column. The process enriches for PBDs having 
affinity for the target and for which the affinity for 
the target is least affected by the eluants used. The 
enriched fractions are those containing viable display 
phage that elute from the column at greater concentration 

IS of the eluant . 

The eiwants preferably are capable of weakening 
noncovalsnt interactions between the displayed and 
the immobilised target material* Preferably, the eiuants 
do not kill the phage? the genetic message corresponding 

20 to successful mini-proteins is most conveniently 
amplified by reproducing the phage rather than by in 
vj&ca procedures such as PCE. The list of potential 
eluante includes salts (including Ha*, SO*- - # 

HjP0. r , citrate, K+, hit., Cs4, H$0 4 ~ , CQ r ~ , C*+*.» 3**+.* 

2S P0 4 ---., H€0 3 -, -Ms**, Ba*+, Br~ , HP0 4 - ~ and acetate } f 

acid, heat, compounds known to bind the target, and 
soluble target material (or analogues thereof) . 

Heutral sdiufee, such as ethanol, acetone, ether, or 
urea, are frequently used in protein purification and are 

30 knoTO to weaken non-covaient interactions between 
proteins and other molecules. Many of these species ere, 
however, very harmful to bacteria ana bacteriophage. 
Urea is known not to harm Ml 3 up to 8 M. Salt is a 
preferred solute for gradient formation in most cases. 

35 Decreasing pH is also a highly preferred eluant. In some 
cases, the preferred matrix is not stable to low pH so 
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that salt and mm the most preferred reagents. 

Other solutes that generally weaken saon-covalerit interac- 
tion between protein® and the target jsaterial of interest 
may also be used. 
S The uneluted display phage contain Dm encoding 

binding doiB&ins which have a sufficiently high affinity 
for the target material to resist the elution conditions * 
The im ©siooding such successful binding domains may foe 
recovered in a variety of ways. Preferably, the bound 

10 display phage are simply eluted by ssaans of a change in 
the elution conditions. Alternatively., one may culture 
the phage is situ, or extract the target -containing 
matrix with phenol Cor other suitable solvent) and 
ampiif y the » by PCE or by recombinant Dm techniques . 

15 Or, if a sits for a specific protease has been engineered 
into the display vector, the specific protease is used to 
cleave, the binding domain from the G£. 

Variation in the support material (polystyrene, 
glass, agarose, cellulose, sfc&J in analysis of clones 

20 carrying SBDs is used to distinguish phage that bind to 
the support material rather than the target.. 

The harvested phage are now enriched for the 
binding- to- target phenotype by use of affinity separation 
involving the target material immobilized on an affinity 

as matrix. Phage that fail to bind to the target material 
are washed away. It may be desirable to include a 
bacteriocidal agent, such as aside, in the buffer to 
prevent bacterial growth. The 'buffers xxsed in 
chromatography include: a) any ions or other solutes 

30 needed to stabilize, the target, and b) any ions or other 
solntes needed to stabilise the SBDs derived from the 

XPBD* 

Recovery of phage that display binding to an 
affinity column is typically achieved by collecting 
35 fractions eluted from the column with a gradient of a 
chaotropic agent as described above, or of the target 



won/mm 



PCF/US92/01539 



SI 



material in soluble form; fractions elating later in the 
gradient are enriched for high-affinity phage. The 
elufced phage are then amplified lis suitsJble host cells. 
If some high- affinity phage cannot be eluted from 
5 the target in viable form, one aafr 

1} flood the matrix with a nutritive medium and grow 

the desired phage M fiitth 
2} remove parts of the matrix and its© them to inoculate 
growth medium, 

10 3} chemically or sazymatically degrade the linkage 
holding the target to the matrix so that GPs still 
hound to target are eluted, or 
4} degrade the phage and recover PH& with phenol or 
other siiitahle solvent; the recovered fM& is used to 
IS transform cells that regenerate ms« 

It ie possible to utilise combinations of these methods , 
It should be remembered that what we want to recover from 
the affinity matrix is not the phage pax ftS* hut the 
information in them as to the sequence of the successful 
20 epitope or binding domain. 

As described in WO9S/02S0S., one may modify the 
affinity separation of the method described to select a 
molecule that binds to material A but not to material 8, 
or that binds to both A and S at competing or 
25 noncompeting sites, or that do as& bind to selected 
targets . 

Using the method of the present invention, we can 
obtain a repli cable phage that displays a novel protein 

30 domain having high affinity and specificity for a target 
material of interest. Such a phage carries both amino - 
acid embodiments of the binding protein domain and a BM& 
embodiment of the gene encoding the novel binding domain. 
The presence of the » facilitates expression of a 

35 protein comprising the novel binding protein domain 
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within a high- level expression system, »Mch need not be 
the same system used during the &e-> * tal process. 

We can proceed to production of tfce novel binding 
protein in several ways, including : a] altering of the 
5 gens encoding the binding domain so that the binding 
domain is expressed as a soluble protein, not attached to 
a phage (either by deleting codons S« of those encoding 
the binding domain or by inserting stop eodons 3* of 
those encoding the binding domain) , b) swing the DMA 
10 encoding the binding domain into a known expression 
system,, and c) utilising the phage as a purification 
system, (If the doaia is small enough, it may be 
feasible to prepare it by conventional peptide synthesis 
methods,} 

IS As previously mentioned,- an .advantage inhering from 

the use of a mini-protein as an XP&b is that it is likely 
that the derived SBD will also behave like a mini -protein 
and will be ofefeainable by means of chemical synthesis. 
(The term « chemical synthesis*, as used herein, includes 

M the use of enzymatic agents in a cell -free environment, 5 
Peptides may be chemically synthesized either in 
solution or on supports. Various combinations of 
Stepwise synthesis and fragment condensation may be 
employed, 

25 Caring synthesis, the amino acid side chains are 

protected to prevent branching. several different 
protective groups are useful for the protection of the 
thiol groups of cysteines s 

1) 4 -methoxybearyl (MBssl? Mob) (KXSH82j KUmSS) , 
30 able with HF; 

2) acetamidomethyl {Aaa) {HXSH82 ? HXSESS? BSCK89C) , 
removable with iodine; mercury ions (e.g. . mercuric 
acetate) t silver nitrate? and 

3 ) s « para~metho«^bensyi . 
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Other thiol protective groups my be found in 
standard reference works such as Greene, P&GTECTXYE 
GROUPS IN ORSJWIC SYHTEEB2B {13&1} . 

Once the polypeptide chain has been synthesized, 
5 disulfide bonds must be formed. Possible oxidizing 
agents include air (KXSH86) f ferrieyanide (HXSH82) , 
iodine {HXSH82}, and performic acid. Temperature, pH, 
solvent,, and chaotropic chemicals say affect the course 
of the oxidation. 

10 A large number of mini -proteins with a plurality of 

disulfide bonds have been chemically synthesized is 
biologically active form* 

The successful binding domains of the present 
invention my, alone or as part of a larger protein* foe 

XS used for any purpose for which binding proteins are 
suited, including: isolation or detection of target 
materials. Xn furtherance of this purpose, the novel 
binding proteins may be coupled directly or indirectly, 
covalently or noncovalently , to a label, carrier ox 

20 support- 

mm used as a pharmaceutical, the novel binding 
proteins xsay be contained with suitable carriers or 

***** 

2S All references cited anywhere in this specification 

are incorporated by reference to the extent which they 
may he pertinent, 
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All calls used la the following examples are jSffiU. 
cells, 

JtXS&UX OF BRI AS A FagXOSf TO MX3 SOTS VXXI PROTOXW* 

5 Example l involves .display of BPTI on M13 as a 

fusion to the mature pie Vlll coat protein. Bach 
construction was confirmed by restriction digestion end 
DMA sequencing. 

1. Construction o f t he y j. j i - 9±sudSJLz. 

The operative cloning vectors are MI3 and pbagremids 
derived from M13. The initial, construction was in the 
£l~foassd phagemid pGBM~32£ CPromega Corp., Madison, 

WX.), 

IS We constructed a gene encoding, in order * ; i> a 

modified lacUVS promoter, ii) a Shine-Dalgarno sequence, 
iii) MX3 gene TO I signal sequence, iv) stature BPTX, v) 
ma.ture<-M13~gene~3?III coat protein, vi) multiple stop 
codons, and vii) a transcription terminator. This gene 

2Q is illustrated in Table 102. The operator of l&s&Sffi is 
the syKsaetrical lacO to allow tighter repression in the 
absence of IPTG. The longest segment that is identical 
to wild-type gene yxxx is minimised so that genetic 
recombination with the co- existing gene ffil is unlikely. 

25 i) OCW based upon pGEM-3££* 

pSm-SBf®** (Proraega Corp., Madison, HI.) is a vector 
containing the §m gene, bacterial origin of replication, 
bacteriophage fl origin of replication, a imst operon 
containing a multiple cloning site sequence, and the T7 

30 and SP6 polymerase binding sequences. 

BamHX and Sail sites were introduced at the 
boundaries of the lacZ operon {to facilitate removal of 
the laej operon and its replacement with the synthetic 
gene) ? this vector is named pGEM~MS3/4 „ 

:3S m OCf based upon K13flfpX8. 
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Ml3mpX8 {YrniSS} is a vector (New England Biolabs, 
Beverly, m..) consisting of the whole of the phage genome 
plus a operon containing a jaultiple cloning site 

(HESS77) « BasrtHX and Sail sites were introduced into 

5 HI 38^X8 at the 5* and 3' ends of the lasS operon? tMS 
vector is named M3.3~HBi/2. 
m Synthetic <3sne. 

A synthetic gene Cmij^.teal....g.g»&aa; tmaiSX&kL 
^jMp^.vm.eQat.otQtein) was constructed from IS 
10 synthetic oligonucleotides, synthesized by Genetic 
Designs Inc. of Houston, Texas., vim a method similar to 
those in KXKH89 and ZSSmS* Table 102 contains an 
annotated version of this sequence. The oligonucleotides 
were phosphorylated f with the exception of the 5' most 

XS molecules, using standard methods, annealed and ligated 
in stages. The overhangs were filled in with T4 vm 
pplyjaerase and the DNA was cloned into the Eiscil site of 
pGWh38*<->; the initial construct is pSEM~»x. Double- 
stranded m& of pSHM-HBl was cut with ggjfcX, filled in 

20 with T4 m& polymerase and ligated to a ML* linker {Hew 
England Hiobabs} so that the synthetic gene is bounded by 
Bain: and g^l sites (Table 3.02 K The synthetic gene was 
obtained on a g§©HX-£aIX cassette and cloned into psm~ 
MB3/4 and JS13-MB1/2 using the introduced BjusHT and Sail 

25 sites, to generate p£EM~»€ and SEL3-HB15, respectively. 
The synthetic insert was sequenced. The original 
Einosame Binding Site IRBS) was in error lM3$m instead 
of the designed AGGRGS) and we detected no expressed 
protein is v£££2 and is vi^ro, 

30 C) Alteration* to the synt&etie gene, 
i) Eifeosome binding sit® (BBS) . 

Xn pGBM-MBl6, a MI^SS? i» fragment (containing 
the IBS) was replaced with an oligonn elect ide encoding a 
new RBS very similar to the RBS of E, saLL » that is 

35 known to function. 
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Original putative SBS {S'-to-S*} 

GTTCTT^silGG w s AG C 



IS The putative BBSs are lower case end the initiating 
methionine codon is underscored end hold. The resulting 
construct is pGEM~MB20 . Xtl ^i&XS expression of the gene 
carried by pSBM~M820 produced a novel protein species of 
the expected size, about 14. S kdU 

TO obtain higher expression of the fusion protein, 
the ;Us3SZS promoter was changed to a promoter. In 
pGSM-MBi€f a BamHI-tolX fragment (containing the lacmrs 
promoter) was replaced with an oligonucleotide containing 
20 the -35 sequence of the &m promoter i£f RUSSS2 } 
converting the Laj | - omoter to fcajs. The vector is 



HB16" 5-'- G& cttcCcg 
2S gctcg. . ~3 4 

3 s - G ceg aaatgt gaa&tacgaag 

gc{cgagc. . - 5 ' 

j i juasu. x—i 

3D 




V Ills . p , •• BFTI -aatursVIXX 
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The synthetic genes from p<3m-MB20 and p6KM~MB26 were 
rationed into the altered phage vector M13-MB1/2 to 
generate the phage M13-MB2? and Ml 3 -H828 respectively, 
Ui. Signal Fepfei&e jtegpuKte*. 
5 Is 3d&£S esqsresaiosi of the apathetic gene regulated by 
tae and the "new* ESS proceed a novel protein of the 
expected else for the unprocessed protein m) , m 

ylvo. expression also produced novel protein of full sisse? 
no. processed protein could foe seen on phage or in cell 

10 extracts by silver staining or by Western analysis with 
ant i - BFTI antibody . 

Thus we analysed the signal sequence of the fusion. 
Table 106 shows a number of typical signal sequences. 
Charged residues are generally thought to foe of great 

IS importance and are shown bold and imflarasfflg^* ^ach 
signal sequence contains a long stretch of uncharged 
residues that are mostly hydrophobic? these are shown in 
lower case. At the right, in. parentheses, is the length 
of the stretch of uncharged residues. We note that the 

20 fusions of gene Mil signal to BFTX and gene III. signal 
to BFTI have rather short uncharged segments. These 
short, uncharged segments may reduce or prevent processing 
of the fusion peptides. We know that the gene III signal 
sequence is capable of directing; a) insertion of the 

25 peptide comprising {mature -BFTI) % i Caiature-g ene»XX I- 
protein} into the lipid bilayer, and fo) translocation of 
SPTX and most of the mature gene 111 protein across the 
lipid bilayer (vide infra) . That the gene 111 remains 
anchored in. the lipid bilayer until the phage is 

30 assembled is directed by the uncharged anchor region near 
the carhoxy terminus of the siature gene III protein {see 
Table 116.) and not by the secretion signal sequence. The 
pho& signal sequence can direct secretion of mature BFTI 
into the periplasm of 1*. coll {M&RK86} - Furthermore, 

3S there is controversy over the mechanism by which mature 
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authentic gene VXXI protein cornea to be in the lipid 
- biiayer prior to phage assembly. 

Thus we replaced the J3m coding for the gena-yXXX* 
puoative-signal -sequence by each of coding for; X) 

5 the phoA signal sequence, 2} the bla signal sequence, and 
3} the M13 gene xxx signal, Each of these replacements 
produces a tripartite gene encoding a fusion protein that 
comprises, in order ; (a) a signal peptide that directs 
secretion into the periplasm of parts (b) and Co) * 

10 derived from a first gene? (h) an initial potential 
binding domain (BFZT in this ease} , derived from a second 
gene (in this case,- the second gene is an ffiiaal gene) ? 
and (e) a structural packaging signal {the mature sens 
VTZZ coat protein} derived from a third gene. 

IS -mm process toy which t«e 1PBD ^packaging- signal fusion 
arrives on the phage surface is illustrated in Figure 1. 
in Figure la, we see that authentic gene VXXX protein 
appears {by whatever process) in the lipid biiayer so 
that both the amino and carfcoasy termini are in the cyto- 

20 plasm. Signal peptidase- l cleaves the gene ¥XXX protein 
liberating the signal peptide (that is absorbed by the 
cell) and mature gene YXXT coat protein that spans the 
lipid biiayer, Many copies of mature gene ¥111 coat 
protein accumulate in the lipid biiayer awaiting phage 

25 assembly (Figure lc> * Some signal sequences are able to 
direct the translocation of quite large proteins across 
the lipid biiayer. Xf additional code-as are inserted 
after the codons that encode the cleavage site of the 
signal peptidase- X of such a potent signal sequence, the 

30 encoded amino acids will toe translocated across the lipid 
biiayer as shown in Figure lb. after cleavage by signal 
peptidase- l f the amino acids encoded by the added codons 
will be in the periplasm but anchored to the lipid 
biiayer toy the mature gene VXXX coat protein, Figure id. 

3S The circular single-stranded phage PHA is extruded 
through a part of the lipid biiayer containing a high 
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concentration of mature gene vxil coat protein? the 
carboxy terminus of each coat protein molecule packs near 
the Dim while the amino terminus packs on the outside. 
Because the fusion protein is identical to mature gene 
5 VXXX coat protein within the trana-bilayer deasaisJ, the 
fusion protein will ca~smm&stol* with authentic stature 
gene ¥1X1 coat protein as shown in Figure le. 

Xn each case, the mature ¥1X1 coat protein moiety is 
intended to co-assemble with authentic mature VIXX coat 

10 protein to produce phage particle having BFTI domains 
displayed on the surface, The source and character of 
the secretion signal sequence is not important because 
the signal sequence is cut away and degraded. The 
structural packaging signal, however t is quits important 

IS because it must co-assemble with the authentic coat 
protein to make a working virus sheath, 
a) BaateriaX Mkaliae Fhosphatasa ffti) Signal Peptide. 

Construct p3m~UBZ€ contains a JS&£X-&££XXI fragment 
containing the new SBS and sequences encoding the 

20 initiating methionine and the signal peptide of HX3 gene 
VIII pro-protein. This fragment was replaced with a 
duplex (annealed from four oligo-nts) containing the BBS 
and Dim coding for the initiating methionine and signal 
peptide of PhoA i,im\W2} t phage is pGBM-MB42 . M13MB48 is 

2S a derivative of GeraMB42. A Mm^-BMM Pm fragment from 
©enM842 f containing the gene construct, was Heated into 
a similarly cleaved vector MEL3MB1/2 giving rise to 
M13MB4S. 

moh ms and signal peptide sequence 
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5 ' ~ G^SC^CCATGGCS^^imATAAA . ATS . M . CAA . AGS . ACS - - 

j saeXj .3»sib Xys gin ear fchr 

. ATC . GCA . CTC . TTA» CCS , TTA , CTG . TTT . ACO . OCT . GTS , ACa . - 
lie aia leu leu pro leu leu phe thr pre val thr 

. ASA.GCC , CGT . CCS. GAT * - 3 * 
Xys aia arg pro : 



. arg pro asp 
lAccIII.j 



To allow transfer of the j? lactamase ism) promoter and 
DNA coding for the signal peptide into the (mtw.- 
BFfX} ■.! s {3sature -¥XXX~coat-protein5 ^®ne, we f irst 
introduced art ascXXl site the gg& adjacent to the 

eodone for the 8-lwB&mm signal peptide cleavage site 
WwP& 3W"6J? vector is pGSM-MB40. We then ligated 
a BaxsHX linker into the ^£11 site at nt 2260 , 5* to the 
promoter; vector is p»!~&SB45. The MlS-HX^SK-XXX fragment 
no*? contains the s» promoter, ajma BBS* initiating 
methionine and lactamase ..signal peptide. This fragment 
was used to replace the corresponding fragment from pGBK- 
pGEK-MB46, 




5 5 - GSATCCGGTSGCACTTr 

WIG . &GT . ATT . Cm. CAT, TTC. O3T.0TC.GCC . CTT , ATT . CCC.TTT . TXT . • 
saet gsr ile <xln his phe arg val aia leu lie pro phe 

phe 

GCG . GCA > TTT . TGC , CTT . CCT.GTT , TTT . SOT . CAT . CCG .-3 s 
aia aia phe eys leu pro val phe aia hie pro. . - « 

c) m£zsm*~ XXX signal t shptls s ;^»^li^il3»lMs 

We amy also construct MX3-MB51 which would carry a Km® 
gene and a m^m^JM^Mmm3^mQSM& g«a« f ragment 
fused, to the previously described bp u?-1I$Z.z&£&&z. 
protein gene fragment. Because M13~M8SX contains no gene 
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xyr r the phages can not. form plaques, but can render cells 
Xafectious phage particles can be obtained via 
helper phage. The geae XLL signal sequence is capable of 
directing tBPTI) s ; Csnatinta-gene- III -protein) to the 
5 surface of phage (vMe_infm) . 
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2. Anal ysis of the Protein j ergfltt s . fc g Flooded M -jm 

Synthetic (signal- pep ti<tet ? iia£ji^r^i^ j zlii^^ fe . 

20 i) Is Zitrp analysis 

A coupled transcription/translation prokaryotic system 
fAmer sham Corp., Arlington Heights, XL) was, utilised tor 
the £s vitro analysis of the protein products encoded by 
the BPTX/VIIX synthetic gene and the derivatives. 

2.5 Table 107 lists the protein products encoded by the 
listed vectors which are visualised by standard 
fluorography following Is Y,&trQ synthesis in the presence 
of 35 s-3G?ethioni»e and separation of the products using 80S 
polyacryismi&e gel electrophoresis. In each sasople, a 

30 pre -^- lactamase product («3i kd) can be seen. This is 
derived froia the as© gene which is the common selection 
gene for the vectors. in addition, a {pre-BPTX/YXXl) 
product encoded by the synthetic gens and variants can he 
seen as indicated. The migration of these species (~X4.5 

35 kd) is consistent with the ejected size of the encoded 
proteins . 

iiJ I» Xlv^ analysis* 
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Thm vectors detailed in sections CB) asfi (C) were 
freshly transfected into the soil strain XLl-folue $TM3 
(Stratagene, l& Jolla, CA> and in strain SBF», &*. mil 
strain SS60O4 (&X8S85) carries t&e prXM mutation aad is 
5 more pexmissiv® in secretion than strains that carry the 
wild- type prlft . S8S004 is rand is deleted for MsX? tfous 
the cells can act foe infected hy M13 aad lasffi and lass 
promoters can not foe regulated with xw». Strain S1F' is 
derived from strain SSSO04 (LISS8S) by crossing with Xfcl- 

10 Blue^; the F ! in Xhi-Siue^ carries Tc* and lasIS 

004 is streptomycin 8 , Te s while XLI-Blue* 1 * is strepto- 
mycin** Tc* so that .both parental strains can be killed 
with the combination of To and streptomycin. SSF* 
retains the seeretion-pemiesiv^ phenotype of the 

IS parental strain, »804{&rlMl* 

The fresh transfectants were grown in B&SYCM medium 
(SAMS 8 9 } for 1 hour, IPTQ was added over the range 1,8 
pM to 0.5 m (to derepress lafiSHS and jfcas) and grown for 
an additional 1,5 hours. 

20 JMiquots of cells expressing the synthetic- insert 
encoded proteins together with controls (no vector, mock 
vector, and no XPTG) were lysed in BUS gel loading foufi fer 
and electrophoreses in 20% poiyacrylamide gels containing 
SPS and urea. Duplicate gels were either silver stained 

2S (DaiiChi, Tokyo, Japan) or eleetrotrausf erred to a nylon 
aatrlx (lasaobilon from Millipore, Bedford, Mfc) for 
was tern analysis using rabbit antl-SPTI polyclonal 
antibodies . 

Table 108 lists the interesting proteins seen by silver 
30 staining and western analysis of identical gels. We can 
see clearly by western analysis that IS?TG« inducible 
protein species containing BPTX epitopes exist in the 
test strains which are absent from the control strains. 
In XLl-Blus^, the migration of this species is predomin- 
3S antiy that of the unprocessed form of the pro-protein 
although a small proportion of the protein appears to 
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migrate at a sise consistent with that of a fully 
processed foa,: Ih 8SP' , the processed form 
predominates, their* being only a faint band corresponding 
to the unprocessed species . 
S Thus in strain. * we have produced a tripartite 
fusion protein that is specifically cleaved alter the 
secret ion signal sequence. We believe that the mature 
protein comprises BFTI followed by the gene VIII coat 
protein and that the coat protein moiety spaas the 

10 membrane . One or more copies, perhaps hundreds of 
copies, of this protein will co»a**«atol« into Ml 3 derived 
phage or M13~lika phagemids. This construction will 
allow us to a) mutagenics the 8£TX domain , h) display 
each of the variants on the coat of one or more phage 

IS lone type per phage) , and c) recover those phage that 
display variants having novel binding properties with 
respect to target materials of our choice, 

R&sched and Oberer (KASC86) report that phage produced 
in cells that express two alleles of gene 212, that have 

20 differences within the first IX residues of the mture 
coat protein, contain same of each protein. Thus, 
because we have achieved in. vivo; processing of the 
phoM signal) «. :fcp.fci; t^mturevni fusion geae, it is highly 
likely that eo- expression of this gene with wild- type 

25 VXSX will lead to production of phage bearing BPTX 
domains on their surface. Mutagenesis of the foj&i domain 
of these genes will provide a population of phage , each 
phage carrying a gene that codes for the variant of BPT1 
displayed on the phage surface. 

30 ¥XI1 display Phage * Production, Preparation and &nalyeis< 
i. Phage Produetioa. 

The OCV can be grown in XLl-Blne"** in the absence of 
XPTG. Typically, a plague plug is taken from a plate and 
grown in 2 «& of saedium, containing freshly diluted 

35' cells , for 5 to 8 hours . Following centrifugation of 
this culture, the supernatant Is titsred. This is kept 
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as a pfeage stock for further infection, phage production, 
and display of the gene product of interest, 

l. 100 -fold dilution of a fresh overnight culture of SEF f 
cell® in 500 ml of medium is allowed to grow to a 

5 cell. density of 0.4 iM> 6Mwm) in a shaker at 37*C. To 
this culture is added a sufficient amount of the phage 
stock to give a M01 of 10 together with XFSG to give a 
final concentration of 0.S sM. The culture is allowed to 
grow for a further 2 hrs* 

10 ii. Bhage Preparation and Purification, 

The phage-produeing bacterial culture is eentrifugad to 
separate the phage in the supernatant from the bacterial 
pellet. To the supernatant is added one quarter, toy 
volume, of phage precipitation solution (20% PEG, 3,75 M 

15 awanium acetate) and SUSP to a final concentration of 
ZsM* it is left on ice for 2 hours after which the 
precipitated phage is retrieved by cent rifugat ion. The 
phage pellet is redissolved in TrisEDTA containing 0,1% 
Sarkosyl and left at 4°C for x hour after which any 

20 bacteria and bacterial debris is removed toy centrifuge- 
tiou. The phage in the supernatant is reprecipitated 
with PEG overnight at 4*C„ The phage pellet is 
r® suspended in W> medium and repreciptated another two 
tiawfto remove the detergent. The phage is stored in LB 

25 medium at 4°C, ti tared, and used for analysis and binding 
studies. 

A more stringent phage purification scheme involves 
eentrifugation in a CsCl gradient 13. SS g of CsCl 
dissolved in m? buffer {0,1 M NaCi, isM Q.1M Tris 

30 pH 7,?} to jmke 10 ml) . 10 n to 10 53 phage in TE Sarkosyl 
buffer are mixed with 5 ml of CsCl MET buffer and 
centrifuges overnight at 34K rpm in, for example, a 
Sorvall 0TD-65B Ultracentrifuge . Aliquota of 400 *il are 
carefully removed. S pi aliqputs are removed from, the 

3S fractions and analysed by agarose gel electrophoresis- 
afrer heating at 6S«C for 15 minutes together with the 
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gal loading buffer containing 0.1% BBS , Fractions 
containing phage are pooled, the phage repreeipitated and 
finally redissolvea in LB medium to a concentration of 
W u to io w phage per ml. 
S iii. toga Analysis, 

The display phage are analysed using standard methods 
of poiyacrylamide gal electrophoresis and either silver 
staining of the gel or aieetrotransf er to a nylon matrix 
followed by analysis with ant i~ SOT antiserum {Western 

10 analysis) . Display of heterologous proteins is 
quantitated by comparison to serial dilutions of the 
starting protein, for example BIPTX, together with the 
display phage samples in the electrophoresis and Western 
analyses, An alternative method involves mnaing a 2~ 

15 fold aerial dilution of a phage in which both the major 
coat protein and the fusion protein are silver stained. 
Comparison of the ratios of the two protein species 
allows one to estimate the number of fusion proteins per 
phage since the number of VIII gene encoded proteins per 

20 phage C*-3Q00} is known, 

3&sasaa£iy^^ 

la vivo expression of the processed SOT o VII I fusion 
protein, encoded by vectors GemM842 and KX3K843., 
indicated that the processed fusion product is probably 

25 located within the cell membrane. Thus, it could he 
incorporated into the phage and that the BPTI moiety 
would be displayed at the phage surface. 

SEP 5 cells were infected with either M13MB48 or M!3mpl8 ( 
as control « The resulting phage were elattrophoresad {** 

30 10" phage per lane) in a 20% polyacrylaadde gel 
containing urea followed by eleetrotranafer to a nylon 
matrix and western analysis using snti - BPTI rabbit serum, 
& single species of protein was observed in phage derived 
from infection with the MX3MB4S stock phage which was not 

3S observed in the control infection* ^his protein migrated 



at an apparent size of *12 &d, cone latent with that of 
the fully processed fusion protein. 

Western analysis of SEP* bacterial lysate with or 
without phage infection demonstrated another species of 
protein of about 20kd, This species was also present, to 
a lesser degree , in phage preparations which ware simply 
PEG precipitated without further purification (for 
example, using nonionie detergent or by CsCl gradient 
centrifugation} , A comparison of M13MB48 phage 
preparations made in the presence or absence of detergent 
aldamonstratsd that sarkosyl treatment and CsCl gradient 
purification did remove the bacterial contaminant while 
having no effect on the presence of the BPTXsVXXX fusion 
protein. This indicates that the fusion protein has been 
incorporated and is a constituent of the phage body. 

The time course of phage production and SBTXsVttX 
incorporation was followed post -infect ion and after IPTG 
induction. Phage production and fusion protein 
incorporation appeared to be maximal after two hours. 
This time course was utilised in further phage 
productions and analyses. 

Polyacrylamide electrophoresis of the phage prepara- 
tions, followed by silver staining, demonstrated that the 
preparations were essentially free of contaminating 
protein species and that an extra protein hand was 
present in M13MS4S derived phage which was not present in 
the control phage. The si.se of the new protein v?as 
consistent with that seen by western analysis. A similar 
analysis of a serially diluted BPTIcVIII incorporated 
phage demonstrated that the ratio of fusion protein to 
xsajor coat protein was typically about uiSO. Since the 
phage contains about 3000 copies of the gene VXXX 
product, the phage population contains, on average, 10' s 
of copies of fusion protein per phage. 
Altering the initi&fcisg methionine of the aat 
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The OCV M13MS48 contains ths synthetic gene encoding the 
BFTCtVIli fusion protein in the intergenic region of the 
modified MX3.mplS phage vector. The remainder of the 
vector consists of the mm genome. To increase the phage 
5 incorporation of the fusion protein., we decided to 
diminish the production of the natural gene VXIZ product 
by altering the initiating methionine codon of this gene 
to em in such cases, methionine is incorporated, but 
the rate of initiation is reduced. The change was 
10 achieved by site -specific oligonucleotide mutagenesis. 

M X X S -rest of VIII 
ACT . TCC . TC .MS . . M0 . TCT . 
rest of XX - T S 3 stop 

15 

Site -specific mutagenesis. 

(D K x s -rest of 

VIII 

20 ACT * TCC , AG . C1t3 . MJi « « TCT . 

rest of XI - T 8 S stop 

Analyses of the phage derived from this modified vector 
indicated that there was a significant increase in the 
25 ratio of fusion protein to major coat protein. 
Quantitative estimates indicated that within a phage 
population as much as 100 copies of the BPTIsVIXI fusion 
were ino i mage . 

display of SFTXsVXIX fusion protein by bacteriophage. 

30 The BPTXiVXIX fusion protein had been shows to be 

incorporated into the body of the phage. This phage was 
analyzed further to demonstrate that the BPTX moiety was 
accessible to specific antibodies and hence displayed at 
the phage surface > 

35 We added purified, polyclonal rabbit anti-SPTX XgC to a 
known titer of phage. Following incubation, protein A- 
agarose beads are added to bind IgG and left to incubate 
overnight. The IgG -protein A beads and any bound phage 
are removed fey. cent rifugat ion followed by a ratitering of 

40 the supernatant to determine loss of phage. The phage 
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bound to %3m beads can also be acid elnted and tit«red. 
The assay includes controls, such as a «F phage stock 
CMXJmpXS) and IgG purified from normal rabbit pre -immune 
serum. 

5 Table 140 shows that wMls the titer of the WT phage is 
unaltered by auti-SPTX igQ, BFTl-SOME (positive control, 
vice 4a£m) < demonstrated a significant drop in titer 
with or without the extra addition of protein A beads. 
(Hate, since the BPTX moiety is part of gXXIp that binds 

10 phage to bacterial pili, this is expected,) Two batches 
of M13MS48 phage (containing? the BSTXtVIXX fusion 
proteinj demonstrated a significant reduction in titer, 
as judged toy pfus, when anti-BPTX antibodies and protein 
A beads were added. The initial drop in titer with the 

IS antibody alone, differs soajewhat between the two hatches 
of phage. Retrieval, of the i»nopracipitate& phage, 
while not quant i tat ive, was significant when compared to 
the Wf phage control. 
Further controls are shown in Table 141 and Table 142. 

20 The data demonstrated that the loss in titer observed for 
the BPTX?VXXX containing phage is a result of the display 
of BFTI epitopes by these phage and the specific 
interaction with anti-SPTX antibodies, m significant 
interaction with either protein A agarose beads or igG 

25 purified from normal rabbit serum could be demonstrated. 
The larger drop in titer for MX3M848 batch five reflects 
the higher level incorporation of the fusion protein in 
this preparation. 

Functionality of the BFTX moiety in the display 
30 phage. 

The previous two sections demonstrated that the 
B£TX*¥XXX fusion protein has been incorporated into the 
phage body and that the BPTX moiety is displayed at the 
pliage surface. To demonstrate that the displayed 
35 molecule is functional, binding experiments were 
performed in a saauner almost identical to that described 
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in the previous section «aceept that proteases vera used 
in pl&m of antibodies. The display phage and controls 
are allowed to interact with isnmobiliaed proteases or 
immobilized Inactivated proteases. Binding is aaeassod 
5 by the loss in titer of the display phage or by 
determining the phage bound to the heads . 

Table 143 shows the results of an experiment in which 
BFFX.VXXX display phage, M13MB48, were allowed to hind to 
anhydrotrypein-agarose beads, There was a significant 

10 drop in titer compared to WT phage (no displayed BSTX) .. 
A pool of phage Pool) , each contain a variegated 5 

amino acid extension at the SPTItmajor coat protein 
interface, demonstrated a similar decline in titer. In 
control {table 143} , very little non-specific binding of 

XS the display phage was observed with agarose beads 
carrying an unrelated protein (streptavidin) , 

Actual binding of the display phage is demonstrated by 
the data shown for two experiments in Table 144, The 
negative control is M13wpl8 and the positive control is 

2.0 BPTX-XXXMK, a phage in which the BFTX moiety, attached to 
the gene III protein, has been shown to be displayed and 
functional. M13MB4S and M13MB56 both bind to 
anhydrotrypsin beads in a aaanaer comparable to that of 
the positive control, being 40 to 60 times better than 

2S the negative control (non-display phage) . Hence, 
functionality of the SPTX moiety, in the major coat 
fusion protein, was established, 

Furthermore, Table .145 compares binding to active and 
inactivated trypsin by phage. The control phage, M13mpX8 

30 and BPTl-IXl MK, demonstrated binding similar to that 
detailed elsewhere in the present application. Mote that 
the relative binding is enhanced with trypsin due to the 
apparent marked reduction in the non-specific binding of 
the wr phage to the active protease. Ml 3, 3X7 and 

35 HI 3 ,3X11, which each contain 'EGGGS' linker extensions at 
the domain interface, bound to anhydrotrypsin and trypsin 
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in a fswm&r similar to BPTX-IIIMK phage. TSx& binding, 
relative to rum- display phage* «s *«X0Q-foXd higher i» 
the anhydrotrypsin feindixtg assay and at least 1000-fold 
higher in the tripsin bidding assay. The binding of 
another 'BOSGS' linker variant $Ml3.33Ea> was similar to 
that of M13.3X7. 

To demonstrate the specificity of binding the assays 
were repeated with human, neutrophil elast&sa CHHBJ beads 
and compared to that seen with trypsin beads Table 146. 
8m has a very high affinity for trypsin and a low 
affinity for wm, hence the BPTX display phage should 
reflect these affinities when used in binding assays with 
these beads. The negative and positive controls for 
trypsin binding were as already described above while an 
additional positive control for the HNS heads,, 
.BPTI (KlSbiVttm) ~x£x m was included. The results, shorn 
in Table 146, confirmed this prediction. M13MB48, 
MX3»3x? and MX3»3Xii $>hage demonstrated good binding to 
trypsin, relative to W phage and the HSIB control 
{ BPTI { K1S h , M»M3 } - XX I MM, being comparable to BPTX-XIXMK 
phage. Conversely poor binding occurred when HKE beads 
were used,, with the exception, of the KSB positive control 



Taken together the accumulated data toons traced that 
when BPTX is part of a fusion protein with the ssajor coat 
protein of M13 phage., the molecule is both displayed at 
the surface of the phage and a significant proportion of 
it is functional in a specific protease binding jsaxmer. 
— *** — 

CONaimu'CTXOH OF BPXX/SEBB-XXX BXSKE&Y 
Pffi manipulations were conducted according to standard 
procedures as described in Maniatis gjfc aJL. (M&NXB2 } or 
Sambrooh ,§t al*. {S»*©89} - First the l afigf, gene of MX3- 
MBX/2 was removed. M13-MB1/2 HP was cut with BaatHI and 
gall and the large fragment was isolated. Ti 
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bp fragment was filled in with Klenow enzyme, 
Xigated to a HiadXIX Sxner linker , and used to transfect 
Xbl-Bl\xeF** } ( stratagene f to Jolla f CA) cells which were 
subsequently plated for plague formation. RF DMA was 
S prepared from chosen plaques and a clone, MB.3-MB1/2- - 
delta, containing regenerated £aspi and Qgll s'ite» and a 
new lindXIT site, all 500 % upstream of the gglXI site 
(6935) , was picked. 
A unique 3Karl site wsvs introduced into codons l? and is 
10 of gene EX {changing the amino acids from H«S to <3*&» 
££. Table 110} in MX3-MBX/2- delta; 

13 14 15 16 1? 18 IS 20 21 
PFYSHS&ET 
S'-ct ttc tat tct can tec get gaa ac-3' wild- type 
15 3<~ga aag ata aga ccg egg cga ctt tg-5* 

mut&pmissing qligo 

S'-ct ttc tat tct qgc gee get gaa ac-3* matant 
V ? Y S G A ART 
The presence of a unique larl site at nucleotide imo was. 
20 eonf irssed by restriction ensyme analysis? the ne^ vector 
is m3-MBX/2~delta~NarX. Phage MK w ssade by cloning 
the 1.3 Kb SagHI Km 8 fragment from pl&emld pQC*t 
{ Pharmacia, Piseataway, NJ) into MX3 -MB1/2 -delta- Harl . 
Phage MR grows as well as wild- type MX3, indicating that 
25 the changes at the cleavage site of gene 1X1 protein are 
not deteetably deleterious to the phage. 

The BPTX-XXX expression vector was constructed by 

standard means. The synthetic fc©ti.~VXXI fusion contains 

30 a Marl site that comprises the last two eodons of the 

BPT2> encoding region, A second sari site was introduced 

upstream of the BPT1- encoding region by lighting the 

adaptor shown to &s.sXXX~eut MX3-MB2S: 

5 < -TATTCTGGCGCCCGT -3' 
3 S 3'- ATAASACCXBCGGGCAGGCC - 5 < 

jHarX j lAcelXX 



The ligation sample was then restricted with MaxX and a 
180 bp Dm fragment encoding 8PT! was isolated. RF DNA 
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of phage ME was digested with j«u&, dephosphorylated, and 
iigated to the 180 bp fragment. higation samples were 
used to traasfect m-Blue 0 * 9 which were plated for %$P 
plagues, mk, isolated from phage derived from plaques 
S was test for hybridisation to a. *F-phasphoryl&fced double 
stranded D8& probe corresponding to the BPTX gene. Large 
scale RF preparations were taade for doxies exhibiting a 
strong hybridisation signal. Restriction easjflae 
digestion analysis confirmed the insertion of a single 

10 copy of the synthetic BPT1 gene into gene ffi of MK to 
generate phage MK-BFTI. Subsequent DHA sequencing con- 
fined that the sequence of the m&zZU. tvsian gene is 
correct and that the correct reading frame is maintained. 
Table 116 shows the entire coding region, the translation 

15 into protein sequence, and the functional parts of the 
polypeptide chain. ^ ^ ^ 

MK-BPTX W SM& was added to a coupled prokaryotic 
transcription-transiation extract (Miershaa) . Newly 

20 synthesized radiolabelled proteins were produced and 
subsequently separated by electrophoresis an a 331 BBS- 
polyacrylaiside gel. The MK-BPTX DH& directs the 
synthesis of an unprocessed gene 1X1 fusion protein which 
is 7 m larger than the WT gene lit, consistent with 

25 insertion of SB amino acids of BPTX into gene III 
protein. We immunoprecipitated radiolabeled proteins 
from the cell -free prokaryotic extract. Neither rabbit 
anti{M13 -gene- VIXI-protein} IgS nor norml rabbit IgO 
were able to i^manoprecipitate the gene XXI protein 

30. encoded by either ME or MK-BSTX. However, rabbit 
ant 1- BPTX IgG is able to precipitate the gene III protein 
encoded by KK-BFTI but not by HE. This confirms that the 
increase in size of the XXX prctein encoded by m-BFTX is 
attributable to the insertion of the BPTX protein. 

3S 
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Phage ware recovered from cultures by PEG precipitation. 
To remove residual ceils, recovered phage were 
rasuspended Is a high salt buffer and centrifuges, as per 
instructions for the KPTA-aSNE^ MX3 is vitro tfetagenssis 
S Kit (Catalogue Number 170-3S7X, Bio -Sad, Richmond, C&) ♦ 
Aliquot a of phage (containing up to 40 jL*g of protein) 
ware eleetrophoresed on a 12.5% SOS « urea -poiyaeryiamide 
gal and proteins were .electro- transferred to a sheet of 
Xsmobilon. Western blots were developed using rabbit 

10 anti-BPTX serum, which had previously been incubated with 
an fiSli extract, followed by goat ant -rabbit antibody 
conjugated to alkaline phosphatase. An imuaoreac t ive 
protein of 67 £d is detected in preparations of the MK> 
BfcTX but not the MK phage. The sisse of the kamaaaxm* 

15 active protein is consistent with the predicted pirn of 
a processed BPTi-XIX fusion protein {6,4 Kd plus SO .$&}..» 
these data indicate that SPTT-spacif ic epitopes are 
presented en the surface of the MK-BFTI phage but not the 
m phage. 




Aahydro- trypsin is a derivative of trypsin having the 
active site serine converted to dehydroalanine . Anhydro- 
trypsin retains the specific binding of trypsin but not 

25 the protease activity. Unlike polyelonalantibodies, 
aahydro- trypsin is not expected to bind unfolded BPTX or 
incomplete fragments . 

Phage MR- BPTX and MK were diluted to a concentration 
1.4-10 i2 particles per ml. in TBS buffer (PARMS8) contain- 

30 ing 1,0 sng/ml B8&. 30 al of diluted phage were added to 
•2, S, or 10 pLl of a 50% slurry of agarose- ixamobilised 
auhydro- trypsin (Pierce Chemical Co. , Rockford, XL) in 
fBS/BSA buffer. Following incubation at 25*0., aliquot s 
were removed, diluted in ice cold LB broth and titered 

35 for plague- forming units on a lawn of Xbl-Blue*^ cells. 
Table 114 shows that incubation of the MK-BPTX phage with 
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issmobiliaed anhydro- trypsin results in a very signif icant 
loss in titer over a four hour period while no such 
effect is observed with the MK (control) phage. The 
reduction in phage titer is also proportional to the 
S amount of iEMohilisad anhyd.ro- trypsin added to the ME- 
BFTI phage. incubation with 5 pi of a SQ% slurry of 
agarose- iBimotoiXized etreptavidih {Slym, St, Louis, m) 
in TSS/BSA buffer does not reduce the titer of either the 
m-BZTX or MSC phage. These data are consistent with the 

10 presentation of a correctly- folded, functional BFTI 
protein on the surface of the M£~BFTX phage hut not on 
the MSC phase. Unfolded or incomplete BPTX dosnai&s are 
not ejected to bind anhydro. trypsin* Furthermore,, 
unfolded BFTI dasains are expected to he non- specifically 

IS sticky. 

UTOlM^iaXXON OF BHhSB TITER W2TH »X»BFTX &&TXB0BY 

MK-BPTI and HK phage ware diluted to a concentration of 
4.10 s plaque- forming units per asl in LB broth. 15 pi of 
diluted phage were added to an equivalent volume of 

20 either rabbit anti-BPTX serum or norasal rabbit serum 
(both diluted 10 -fold in LB broth) . Following incubation 
..at.37°C* aliguots were reeved, diluted by IS* in ice » cold 
LB broth and titered for pfius on a lawn of Xhi-Blue ffM) . 
incubation of the MR-BPTX phage with anti-BFTI serum 

25 results in a steady loss in titer over a two hour period 
while no such effect is observed with the MK phage, m 
ejected, normal rabbit serum does not reduce the titer 
of' either the NK-SPTX or the MK phage. Prior incubation 
of the anti-BFTX serum with authentic BPT1 protein but 

30 not with an equivalent amount of E*. £2ii protein, blocks 
tbe ability of the senna to reduce the titer of the HK- 
BPTI phage . These data are consistent with the 
presentation of BPTX- specific epitopes on the surface of 
rhe m-BJ>TX phage nut not the Sfx phage. More specif i- 

35 oally f the data indicates that these BBTX epitopes are 
associated with the gene XXX protein and that association 



PCFAJS92/01539 



of this fusion protein vitfc a» anti-SPTX antibody blocks 
its ability to mediate tb$. infection of cells. 

MK-BPT1 ana Hit phage *ere diluted to a concentration of 
S 4-10* plaque -forming units per ml in LB broth. Diluted 
phage were added to m equivalent volwse of trypsin 
diluted to various concentrations in LB broth. Following 
incubation at 37 0 c„ aliquots were removed, diluted by 10 4 
in ice cold 1*8 broth and titered for plaque -forming units 
10 on a lawn of XXsl-Biue^} , Incubation of the ME-BBTl 
phage with 0.15 jctg of trypsin results in a 70% loss in 
titer after two hours while only a 15% loirs in titer is 
observed for MK phage. A reduction in the amount of 
trypsin added to phage results in a reduction in the loss! 
15 of titer. However, at all trypsin concentrations inves- 
tigated , the MR*BJ?TX phag* are more sensitive to incuba- 
tion with trypsin than the mc phage. Thus, association 
of the BPTI-IXI fusion protein displayed on the surface 
of the MK-SPTX phage with trypsin blocks ite ability to 
20 mediate the infection of cells. 

The reduction in titer of phage m toy trypsin is an 
example of a phenomenon that is probably generals 
if present in sufficient quantity *, will 
proteins on the phage and reduce inf activity. 



JiFFXBITT SELECTION STSTSS 

Mfinlty Selection with immobilized tohydro -Trypsin 

MK-8PTI end MK phage were diluted to a concentration of 
1,4*10® particles per ml in TBS buffer { p asms 8 } 
containing 1,0 mg/ml BSA. We added 4 .0*l0 se phage to S p,l 
of a 50% slurry of either agarose- immobilised axihydro- 
trypsin beads (Pierce Chemical Co.) or agarose- 
i«dbiiised streptavidin beads (Sigma) in TBS/BSA. 
Following a 3 hour incubation at room teaiperature f the 
beads were pelleted toy centrifuge t ion for 3.0 seconds at 
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5000 rpm in a mlcrofugs and tfee supernatant fraction was 
collected. The beads were wasbed 5 times with TBS/Tween 
touf£«r and after each wash the beads were 

pelleted by cant rifugation and the supernatant was 
removed. Finally,- beads were resuspended in elution 
buffer (0.1 K HCi containing 2.0 mg/ml BSh adjusted to pH 
2.2 with glycine} and following a 5 minute incubation at 
room temperature, the beads were pelleted by centrifuge - 
tion. the supernatant was removed and neutralized by the 
addition of 2.0 M Tris-HCl buffer, pH 8.0. 

Aliquots of phage samples were applied to a Hytrah 
membrane using a Schleicher and Schuell fICeene, HE) 
filtration miaifold and phage Dm was immobilised onto 
the isytran by baking at 80*C for 2 hours. The baked 
filter was incubated at 42 «C for X hour in pre -wash 
solution (M&H1S2) and pre -hybridist ion solution (Sfcrime- 
3Prime, west Chester, S*K The 1.0 Kb msX 
i£30)/a&mX (base 2646) Dm fragment from MK HF was 
radioactively labelled with w P~dCTP using an oligoiaball- 
ing Mt (Pharmacia, Piseataway, HJ) . The radioactive 
probe was added to the Nytran filter in hybridisation 
solution (5Prime-3Prime> and, following overnight incuba- 
tion at 42 *c, the f ilter ma washed and autoradiographed. 

The efficiency of this affinity selection system can be 
semi -quantitatively determined using a dot -blot 
procedure. Exposure of MC-BPTX- phage -treated anbydro- 
trypsin beads to elution buffer releases bound MX-BPTf 
phage. Streptavi&in beads do not retain phage MK-BFTI. 
Snhydro- trypsin beads do not retain phage MBL In the 
experiment depicted in Table 125, we estimate that 20% of 
the total MK-BPTI phage were bound to 5 pi of the 
immobilized aahydro- trypsin and were subsequently 
recovered by washing toe beads with elution buffer CpH 
2.2 HCl/glycine) - X&der the same conditions,, no 
detectable MK-BPTI phage were bound and subsequently 
recovered from the streptavidin beads. The amount of MK- 
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BPTI. phage recovered in tZm alution fraction is 
proportional to the arrant of immob ;^r>U ar^ydro- trypsin 
added to the phage . No detectable MK phage were bound to 
either the ixsmohiiised anhydro- trypsin or strsptavidin 
5 beads and no phage were recovered with elution buffer. 
These data indicate that the affinity selection system 
described above can be utilised to select tor phage 
displaying a specific folded protein (in this case, 
BPTI) . Unfolded or iaeosnplefc© : BFFt domains are not 
10 expected to bind anhydro- trypsin. 

Affinity Selection with toti-BPTI antibodies 

MK » BPTI and KK phage were diluted to a concentration of 
1> iq» particles per ol in Tris buffered saline solution 
(SARM38) containing IvO mg/ml BSA. Two* 10 s phage were 
IB added to 2.5 MS of either hiotinylated rabbit smti-BFKS: 
XgG in XBS/BSA or biofcinylated rabbit anti -mouse antibody 
XgG {&ig»a} in TB8/BS&, and incubated overnight at 4*C, 
& S0% slurry of straptavidin- agarose {Sigstsa} f washed 
three times with TBS buffer prior to incubation with 30 
20 mg/mX BSA in TBS buffer for 60 minutes at room tempera- 
ture, was washed three times with TBS/Twaen buffer 
(WMM8B) and resuspended to a final concentration of S0% 
in this buffer. Samples containing phage and 
biof inylated IgG ware diluted with TBS/Tween prior to the 
25 addition of strept&vidin- agarose in TBS/Tween buffer. 
Following a 60 minute incubation at room temperature , 
streptavidin- agarose beads were pelleted by eentrifu- 
gation for 30 seconds and the supernatant fraction was 
collected. The beada were washed S times with TBS/Tween 
3.0 buffer and after each wash, the beada were pelleted by 
centrifugation and the supernatant was removed. Finally, 
the straptavidin-agarosa beads were resuspended in 
elution buffer {0.1 ST HC1 containing 1.0 mg/nd BSA 
adjusted to pH 2.2 with glycine), incubated 5 minute at 
35 room temperature, and pelleted by centrifugation. The 
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supernatant was removed and neutralised by the addition 
of 1.0 M. Tris-HCl buffer, pB 8.0, 

Miguota of phage samples were applied to a Nytran 
membrane using a Schleicker and Schuell minifold appar- 
S atus. Phage m& ma issadbiliased onto the Mytran .by 
baking at 80«C for 2 hours. Filters- ware washed for so 
etinutas in pre- wash solution (mmXBZ} at 4.2 *C then, 
incubated at 42*C for 60 aiinntss in Southern pr^~ hybri- 
disation solution (SPri»3Brime3 . The 1.0 Kb M£X 

10 ti630bp}/SBsl (2646 bp) DBA fragment from MK HP was 
radioes tiveiy labelled with 3 ¥~&'dCTP using an oligo- 
labelling kit {Pharas&cia, Piscataway, HTJ) . Hytran 
membranes were transferred from pre- hybridization 
solution to Southern hybridisation solution (SPrime — 

IS SPrltae} at 42 °C. The radioactive probe was added to the 
hybridisation solution and following overnight incubation 
at &£* the filter was washed 3 times with 2 « BSC, 0,1% 
SDS at room temperature and once at 65 *C in 2 x BSC, 0,1% 
SDS. Nytran membranes ware subjected to autoradiography. 

20 Urn efficiency of the affinity selection system can be 
semi -quantitatively determined using the above dot blot 
procedure. Comparison of dots &X and Bl or CI and Dl 
indicates that the majority of phage did not stick to the 
streptavidin- agarose heads. Washing with TSS/Twaen 

as buffer removes the majority of phage which are non~ 
-specifically associated with streptavidin beads. 
Exposure of the streptavidin beads to eXutioa buffer 
releases bound phage only in the case of MK-BPTX phage 
which have previously been incubated with toiotinyXated 

30 rabbit anti-BPTI XgG, This data indicates that the 
affinity selection system described above can be utilised 
to select for phage displaying a specific antigen tin 
this case 3FTI ) . we estimate an enrichment factor of at 
least 40 fold based on the calculation 

Perc-x, ga recovered 
Enrichment Factor - — — • — ~ ~™ 
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To increase the flexibility between BPTl and nature 
gVXXXp, we introduced colons for peptide extensions 
between these domains. 

XG The M13 gene III product contains < stalk- Xik«' regions 
as implied toy electron micrographic visualisation of the 
bacteriophage {L0PB85} . The predicted amino acid 
sequence of this protein contains repeating motifs , which 
include t 

3.5 giu,gly»gly.gly»ser (S<3GGS) seven times 

gly.gXy.gly.ser {SOSS} three times 

glu,gly,gXy.gIy-thr (EaaST) ones* 
The aim of this section was to insert, at the domain 

interface, multiple unit extensions which would mirror 
20 the repeating motifs observed in the ill gene product. 

Two synthetic oligonucleotides were synthesized. We 

picked the third base of these oodons so that translation 

of the oligonucleotide in the opposite direction would 

yield SEE* Mien annealed the synthetic oligonucleotides 
25 give the following unit duplex sequence Can BGGGS 

linker) i 

E G G C S 
S< C.S»G.GGA.GGA.GGA.TC 3' 
3« TC,CCT.CCT<CCT,AOS»C 5 1 

30 m {$} cs) csj m 

The duplex has a common two base pair 5 ! overhang {GO 
at either end of the linker which allows for both the 
ligation of multiple units and the ability to clone into 
the unique Harl recognition sequence present in OCV ! s 
35 and Oem MB42u This site is positioned within 1 

oodon of the tMk encoding the interface « The cloning of 
an EGGGS linker (or mai tip! a linker) into the vector ggxl 
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site destroys this recognition sequence, insertion of 
the BGGGS linker in reverse orientation leads to 
insertion, of SSSSL into the fusion protein. 
Addition of a single SGOGS linker at the m&Z ®it© of 
S the gene shown in Table 113 leads to the following gens; 

79 80 80a 80b 80C 80d SOe 81 82 83 84 
8 0 E 0 G 6 B A A S S 

. 0 OGT . <SGC . GAG . GGA . GGA . GQA , TCC . GCC .OCT - GAA ♦ QGT 



Hote that there is no preselection for the orientation 
of the linker Cs) inserted into the QCV end that multiple 
IS linkers of either orientation (with the predicted EGSGS 
or G&SSh amino acid sequence) or a mixture of 
orientations {inverted repeats of DM) could occur. 

A ladder of increasingly large multiple linkers was 
established by annealing and iigating the two starting 
20 oligonucleotides containing different proportions of 5' 
phosphorylated and nan "phosphorylated ends. The logic 
behind this is that ligation proceeds from the 3« unphos- 
phorylated end of an oligonucleotide to the S * phosphor- 
ylated end of another. The use of a mixture of phosphor- 
us ylated and non-phosphoryiated oligonucleotides allows f or 
an element of control over the extent of multiple linker 
formation, & ladder showing a range of insert aisses was 
readily detected by agarose gel electrophoresis spanning 
IS bp (1 unit duplex-5 amino acids) to greater than 600 
30 base pairs (40 iigated linkers -200 amino acids) * 

large inverted repeats can lead to genetic instability. 
Thus we chose to remove them, prior to ligation into the 
OCV, toy digesting the population of multiple linkers with 
the restriction ennymes ts&ixi or 2&sX, since the 
35 linkers f when Iigated *head" to-head' or s tail -to- tail' , 
nenerate these recognition sequences. Such a digestion 
significantly reduces the range in sisses of the multiple 
linkers to between l and 8 linker units between S 
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mid 40 amino acids in steps of 5), as i 
gel el setrophoresis . 

The linkers wars ligated (a® a pool of different insert 
sises or as gel -purified discrete fragments) into JfaxI 
cleaved OCVs MX3MB4S or GemM842 using standard methods , 
Following ligation the restriction enryme H&£X was added 
to remove the self -ligating starting OCV (since linker 
insertion destroys the Karl recognition sequence) . This 
mixture «as used to transform Xh-X blue cells and appro- 
priately plated for plaques <0CV M13MR48) or ampieillin 
resistant colonies <0CV <3ea«B42) . 

The transf ortsants were screened using dot blot 
analysis with one of two 3S P labeled oligonucleotide 
probes » One probe consisted of a sequence compl emen tary 
to the D» encoding the Pi loop of BFFX while the second 
had a sequence complementary to the DM encoding the 
domain interface region. Suitable linker candidates 
would probe positively with the first probe and 
negatively or poorly with the second. Plague purified 
clones were used to generate phage stocks for binding 
analyses and BFTX display while the Rf TM& derived from 



analysis and sequencing. Representative insert sequences 
of selected clones analysed are as follows; 

M13 , 3X4 miQ^Q'A, TCC . TCC . TCC . CT (C . GCC ) 

gly ser ser ser leu 

M13 , 3X7 {G G.GJ^.GG&.OSA.GS&.TCCC.Ga:} 

glh gly gly gly ser 

M1.3.3XX1 ICC, 
glu gly gly gly ser gly ser ser 

TCC . CTC - GGA. TCC. TCC . TCC ,CT(C. SCCC) 
ser leu gly ser ser ser leu 

These .highly flexible oligomeric linkers are believed to 
he useful in joining a binding domain to the major coat 
(gene ¥1X1} protein of filamentous phage to facilitate 
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the display of the binding domain on the phage surface, 
a@y smy also be useful in tfce construction of chimeric 
DSPs for other genetic packages as well. 

S Incorporation of interdomain extension fusion proteins 
into pha»a. 

A phage pool containing a variegated peutapaptide 
extension at the BWI:co&t protein interface was used to 
Infect SEF' cells, using the criteria of the previous 
10 section, we determined that extended fusion proteins were 
incorporated into phage, Gel electrophoresis of the 
generated phage, followed by silver staining or western 
analysis with anti-BPTX rabbit serum, demonstrated fusion 
proteins that migrated similarly to, but diseernafoly 
15 slower than, the starting fusion protein. 

With regard to the 'HOGGS linker' extensions of the 
domain interface, individual phage stocks predicted to 
contain one or more S -amino- acid unit extensions were 
analysed in a similar fashion. The migration of the 
20 extended fusion proteins were readily distinguishable 
from the parent fusion protein when viewed by western 
analysis or silver staining. Those clones analyzed in 
more detail included MX3.3X4 {which contains a single 
inverted EGQGS linker with a predicted amino acid 
2S . sequence of QBSSh) , M13.3X7 (which contains a correctly 
orientated linker with a predicted amino acid sequence of 
), (which contains 3 linkers with an 

and a predicted amino acid sequence for the 
of SGGGSGSSSLGSSSL) and MX3.3X& which contains 
30 an extension consisting of at least 5 linkers or 25 amino 
acids. 

<&e extended fusion proteins were ail incorporated into 
phage at high levels (on average 10 «s of copies per phage 
were present and when analyzed by gel electrophoresis 
35 migrated rates consistent with the predicted siae of the 
extension, Clones MIS. 3X4 and M2.3.3X7 migrated at a 
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position very similar to but discs rnabiy differ sat from 
the parent fusion protein, while M0L3.3X11 and MX3.3Xd 
W a r s m a r k e d 1 y larger. 

s 

mam& xv 

Peptide phage 

The following jaaterials and methods were used in the 
10 examples which follow. 

1. Peptide Phage 

HPQ.6, » putative disulf ids-bonded mini -protein, was 
displayed on M13 phage as an insert in the gene XXX 
protein (gXXXp) . MX3 has about five copies of gXXXp p»r 

IS virion. The phage were constructed by standard methods. 
HPQS includes the sequence CHPQFFRC characteristic of 
Devlin's streptevidin- binding E peptide , as well 

as a recognition site (see Table 820} . HPQS phage 

were shown to hind to ; straptavidin , 

20 m unrelated display phage with no affinity for 
streptavidin, MOT, was used as a control. 

2. sStreptavidin. 

Commercially available immobilised to agarose heads 
(Pierce) . Streptavidin CStrRv) immobilized to 6% beaded 

2S agarose at a concentration of 1 to 2 mg per ml gel, 
provided as a 50% slurry. Also available as free protein 
(Pierce) with a specific activity of 14 . 6 units per mg (l 
unit will bind X ag of hiotin) . h stock solution of 1 mg 
per ml in 3?BS containing 0.01% aside is made. 

30 3, P»M©tin. 

Commercially available (Boehringer Mannheim) in 
crystallised form, a stock solution of 4 xm is made. 

4* Streptavidin coating of microti tar well plates* 

Xsssulon (#2 or #4) strips or pistes are ussd. X0Q&L of 
35 Strav stock is added to each 250 jiL capacity well and 
incubated overnight at 4*C. The stock is removed and 
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replaced with 250 jih of BBS containing BS& at a 
eoneestration of i mg per xnh add left at 4«G lor a 
further 1 Sjoar. Erior to use i». a phage binding assay 
the wells are washed rapidly S times with 250 ph of PBS 
S containing 0.1% Tween. 
Binding Assays. 

Between xG and 20 ph of the Strte bead, slurry CS to 10 
^ bsad volwus) is washed 3 times with binding buffer 

10 {TBS containing BSA at a concentration of 1 mg per ssL) 
just prior to the binding assay. SO to 100 pL of binding 
buffer containing control or peptide -display phage ( 10« 
to 10 u total plaque forming units - pfu's) is added to 
each microtuhe. Binding is allowed to proceed for l hour 

IS at room temperature using an end over end rotator, The 
beads are briefly centrifuged and the supernatant 
removed. The beads are washed a further 5 hisses with 1 
mL of TBS containing 0>1% Tween, each wash consisting of 
a 5 min incubation and a brief eentrifugation. Finally 

20 the bound phage are eluted from the Str&v beads by a 10 
isiu incubation with pH 2 citrate buffer containing 1 mg 
per xal BSh which is subsequently neutralised with 260 ph 
of m tris pH 8. The number of phage present in each 
step is determined as plague forming unite ipfu's) 

23 following appropriate dilutions and plating in a lawn of 
F ; containing £. coll. 

To each SfcrAv- coated well is added 100 pit of binding 
buffer (PBS with l m psr mh BSA} containing a known 
30 quantity of phage (between 10 s ami 10" pft'e) . Incubation 
proceeds for 1 hr at room temperature followed by removal 
of the non-bouad phege and 10 rapid washes with BBS 0,1% 
Tweeru The bound phage are eluted with 250 ph of pH2 
citrate buffer containing i mg per mh BSA and 
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neutralization with SO of 1M tris pH 8. The number of 
phage present in each step is determined as plaque 
forming units {pfu's} following appropriate dilutions and 
plating is a ls*n of- F' containing JB, coli. 

Effect of iSSSSuMilfeol tXHFT) 
©a display phag® Madia? to gferepfcavidia~ agarose* 

a. Ose of HSU? -conjugated biotia and -atrept&vidia beads. 
IS Binding capacity of StrAv agarose beads for HSP- 

conjugated foiotin determined to be *« 1 pgr (equivalent to 
* ISO pmol biotia) per 5 aL beads (the amount used $a 

b. Effect of DTT on HEF- conjugated biotin binding to 
IS StrAv beads. 

S pit of Strike beads were incubated with 10 ng of HRP- 
biotin in binding buffer (TBS-BSA) in the presence of 
•varying amounts of DTT (at least 99% reduced) . Following 
a 15 minute incubation at room temperature , the beads 

20 were washed two times in binding buffer and an MF 
substrate added- Color development was allowed to 
proceed and noted in a semi-quantitative manner, Table 
827 ahowe that the binding of biofinylated horseradish 
peroxidase CHRP) is not greatly affected by 

2S concentrations of DTT below 20 tM, ,h\», DTT 
concentrations of 20 and so mM also inhibited the 
interaction of HHP arid substrate In the absence of Str&v 
beads hence having a general negative effect in this 
system. 
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10 s pfus of HPQS were added to binding buffer CTBS-BSh) 
in the presence of different concentrations of DTT» 
Incubated at room temperature for i Hour then diluted and 
5 plated to determine titer as pfus. Table 828 show the 
effect of DTT on the inf activity of phage H»Q6. Hence, 
either 33$T has no effect on phage iafecfcivities over this 
range of concentrations or the affects are reversed on 
dilution of the phage » from these control escperiments it 

10 is apparent that XSTr" can he used at concentrations below 
10 in studies on the effect of reducing agents on 
peptide display phage binding to StrAv, 

Table 829 shows the effect of BTT on the binding of 
phage HFQS and mm to StrAv beads. The most significant 

IS effect of DTT on HFQ6 binding to StrAv occurred between 
0,1 and 1,0 %M STT, a concentration at which no negative 
effects were, observed in the preliminary control 
experiments. These results strongly indicate that, in 
the case of HPQ6 display phage, DTT has a marked effect 

20 on binding to StrAv and that the presence of a disulfide 
bridge within the displayed peptide is a requirement for 
good binding. 

S^&B&SJg OF ST^FT&YXBSIS-SOmm DXSKMKJT PBA<§! 
25 BY FhCTOSl Xa ChmvMSE 

Phage HPQ6 contains a bovine F.^ s recognition site 
> £H/XV} , Xn mmy instances, 1BGR is sufficient 
recognition site, for but we have extended the site 

in each direction to facilitate efficient cleavage. The 
30 effect of preincubating HPQ6 phage with P.X* on binding to 
StrAv heads is shows in Table S32, Thus while this 
concentration of F«X» {2.5 units) had no measurable effect 
on the titer of the treated display phage it had a vary 
marked effect on. the ability of the treated display phage 
35 to bind to StrAv. This is consistent with the StrAv 
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recognition sequence being removed by the action of FX» 
recognising and cleaving the YiEGR/lv sequence. 

Table 833 shows the effect of FX, treatment of HPQ6 
following binding to Sfcr&v* Is it possible to remove 
S display phage bound to their target by the um of FX» in. 
place of pH or chaotropic agent elusion? HPQ6 display 
phage were allowed to bind to StrAv then incubated either 
is FX* buffer or the same buffer together with 2,5 wnit© 
of FX S for 3 hrs. The amount feinted was compared to the 

10 total number of phage bound as judged by a pH2 elution. 
Therefore , while the display phage are slowly removed in 
the buffer alone, the presence of FX, significantly 
increases this rate< 
The removal of K£$6 display phage from StrAv toy FX* was 

15 also studied as a function of the amount of ansysne added 
and th© time of incubation, as shown in Table 834. ff.B, 
at greater concentrations of the enssyme (1.2 U for 1 hour 
or 2.5 it for 2 hours) , a loss in inf activity of the 
treated phage was noted as measured by pf us. 
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Table 10; Abundances cfcfcalead 
from various 

A. Optimized fxS Codon, Restrained by [D] + ES] « [KJ + [RJ 

5 





T C A 


G 




1 1 


.18 J2$ 


,30 


f 


2 1 


,22 -16 .40 


.22 


X 


3 1 


.5 ,0 .0 


.£ 


s 


Amino 




Amino 




■A 


4,80% 


c 


2,86% 


$} 


6,00% 


E 


6.00% 




2,86% 


0 


6,60% 




3.60% 


X 


2,86% 




5,20% 


L 


6.82% 




2.86% 


8 


5,20% 




2.88% 


a 


3.60% 




6*82% 


s 






4,16% 


V 


6,60% 




2,,.S.61-Ife§: 


Y 


5,20% 
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IB] 


4 [ej * m * mi - 


.12 




rat; 


to » iu»m{w)/»iiCS} 


« 0.4074 




i 




{ ratio M 


stop-f rg 


t 


2.454 


.4S74 


.9480 


a 


6.025 


,1660 


.8387 




14 . 78 3 




.8520 


4 


36.298 


.0276 


.3077 


s 


89 , OSS 


.sua 


,7657 


S 


218,7 


4*57,10* 


,7258 


7 


536. § 


1.86-XO- 3 


.6881 
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S B. T3fere»trained, optistiJseS 



,2? 



.27 
.21 



1.0 


3 1 .5 .0 -0 








A *. est 


C 


2.84% 


15 


D 5,81% 


B 


5.81% 




F 2 . 84% 


* 


S..&* 




H 4.08% 


X 


2.04% 




& S , 81% 


X, 


€.83% 




M 2.84% 




3.81% 


.20 


g 2.8S% 


Q 


4.08% 




R 6.83% 


s 






T 4.05% 


V 


s.r/% 






Y 


5.41% 



wo nnsm pcrmsnmmv 



101 



s 



01 * 


[BJ 0,1162 


[it] + mr * o. 


1264 


ratio 


« Ata<W)/abun{S> » 0.41176 




s 








1 


2.4286 


.43.176 


. 9419 


2 


$.8981 


,16955 


,8872 


3 


X4*3£4X 


,08981 


,8356 


4 


34.7875 


. 02875 


,7171 


5 


84,4849 


,011836 ' 


,74135 


6 


205.180 


,004874 


.69828 . 


7 


498,3 


2- 007-10'* 


,6577 
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Table 10 x mmmnQm cfctaiaed 
from mrisms vgCodon 
(conti aued 

S c. Optimised BEST 



20 





T C 


A 


<5 


1 I 


,2071 .2929 


.2071 


.2929 


2 1 


,2329 ,2071. 


.2329 


,2071 


3 j 


X, ~0 .0 


;o 




Amino 

JBUfejWC 




ftlBWL 




A 


6.06% 






B: 


8 , 58% 


* 




F 








H 


ft. 59% 


i 


6,0S% 






& 


8.58% ,. 


M 


none 




€.0$% 




6,G6% 






R 






S-S8.l;...S3fea 


% 




¥ 




W 




Y 





25 gltOTL 
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4,0 .25 . 1- 

8-0 ,125 1. 

1S-0 .8€25 1. 

32.0 .03125 1. 

64,0 .015525 1. 

128.0 ,0078125 1. 



vgCodon 



B. Optimised HNS 

&— S 

1 j ,23 -21 .23 33 
10 2 j .215 .235 .285 .2X5 

3 | .0 ,0 .0 1.0 



Mdri© Amino 



20 







.acid 




A 


9,40% 


G 




■D 


ucn& 


B 




F 




Q 


7.10% 


H 


none 


X 




& 


6.S0% 


h 


_J^SM .Man 


M 


4.90% 


H 




P 


s.oot 


Q 


6.00% 


E 


9.50% 


S 




T 


6.6 % 


¥ 


7,10% 


1 




Y 
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1 . .9388 




2 


3.7SS8 


.26604 


0,8723 


3 


7.2876 


.13722 


0,3148 


4 


14.1289 


.07670 


0,7810 


S 




3.65- 10' s 


e . 7io8 




S3. IDS 




Q.6S33 


7 




9, 72 «1Q* 


0.6200 
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Tafele 10: Aotmdsmcas obtained 
from optimum vgCodon 

s 

E» Unoptimized « {ftgK gives identical distribution) 



fi &■ &~ 

10 X | .25 ,2S .25 ,25 

3 J .23 .25 .25 .25 
3 | .0 .5 .0 0>5 



amino ^sto 



as 



acid ... 


^^mdangg— - 


ac$M 




A 


6.25% 


C 


3.125% 


P 


3vX2S% 


H 


3.125% 




3.125% 


a 






3.125% 


i 


3.12S% 




3.1,25% 


L 


3.373% 




3.125% 


n 


3.125% 




6-25% 


a 


3.125% 




9.375% 


s 


9.375% 


T 




¥ 




W 


3.125% 


Y 


3.123% 
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1 3.0 .33333 .96875 

2 9.0 ,11111 .9385 

3 27,0 .03704 .30315 

4 81.0 ,01234567 .8807 

5 243.0 .004il|2 .3332 
€ 729.0 1.37 > 10-* ,82655 
7 2187.0 4.37*10^ .8007 
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Table 102b ; Annotated S@gueac& of gene 
after insertion cf MI linker 



5 nucleotide 

masher 

S ! - (GG&.TCC TOTGA <3TC) OGC- 3 
tram $®m polylis&er 

10 

»3S MBS "2-0 

IS 

lacO-syrxaa operator 

20 aaactc MISS CttaCT- 77 

Sasl Shine-Dalgarno seg. 



25 



f£M ( I [ K i S | & f V I L f * I & I ■# I 
I 1 | 2 f 3 I 4 I $ ] S I 7 | 8 J 9 UOj 
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l.MX...22,\ tibm 1 1 



j y j A j v I A ! T | I* | v I P | M I L ! 
S I 11| 12 1 13] 14 I IS | 16 1 X?J 18 1 19 | 30 j 

| GTT | OCT | GTC | GCG | ACC j CTG j CSTA j CCT j ATS j TOG j - 13? 

! feu ij iJtekXL 



10 !Sl?U!S|PiDiF!C!L!E| 
! 21 1 22| 23| 24| 25 f 26 f 27 j 28 1 29 1 30 j 
! TCC | TO j GCT S <m | CdHOftT j TTC | TQf j CTC {OftO}- *«? 

* Jtemi .ijmjLX 

M13/BPTX Jact XJ®3£ul I 

15 



20 



\ P | S? | X | T j <3 ! P j C | K | A I E | 
. | 31f 32 1 33 | 34 f 35 j 36 1 37 f 38 1 39 1 40 f 

usiij i n 

j 



j Bra 



momtsm 



no 

Sable 102b * imitated Sequence 
gene after iimertion elMI lister 
(continued) 



j X | X ) E I ? | W | * I 1ST { .* J IE | A J 
j 41 1 43| 43 f **.| 4S1 4S| 4?j 48 1 49 | SO j 
j M?C | &TC | OSC \ TAT \ TTC | TAC | &AT | GCT \ MA f OC | - 226 



i « I x 1 c I o I ,t i ' * I V ] ' * | a } 5 j 

j SXf 52 j S3 | 54) 3S\ 5fi| 57 j 5S | S$j €0 j 

.1 stu ij .L.&co. I t 

as i-Jtei & I 



j C J; & f H I X I E j M \ ■ «T I F I K . | 
I 61| €2f S3] 64f SS^ 6CT.1 S7j «£| 69 1 

1 BgRjL-X 

| S | A | H f 33 1 C j K I * | T j C 1 0 1 
1 70} 731 1 72f **! **1 7Sj 771 *?§ I 

2s j tcgM ace j »| sm? lise |M| cot] &ec I *ec |« j - 3 m 
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Sm/Mi3 boundary 
,1 

1 £3 | A 1 A i B J G I D j D \ P j A j X J A ) A | 

| 90j 81 | 82 | S3 | 84 { SS ] 86 j «7j 88} 89 1 00 j 91 | 

s ! sac | Gee i gct ! a» | sot \ cw j s&t | ccg 1 sec \ Mm j sea j see |> 350 
4JaaJUl .1 £U I 



10 1 * j H j M I . .t = I 0 I A \ B f A j T j 

j 93 1 S3 I 94 j SSj 96 f 97 \ 98 j 99 j 100 j 
I TO j ft&? | TCT | CTS | CM | GOT j TCT j OCT j ACC | - 377 
fHind 31 



j E J T | X I G j 1 j A | W j 
1 101 1 102 j 103 \ 104 1 10S 1 106 j 10? { 

| GAG | TAT ! ATT j OCT { TAG \ GCG f TGG j - 398 



20 | A | M | ¥ | V | f | 1 | V I © j A | 

1 108 j 109 1 110 | lllj 112 1 113 1 114 1 IIS j IIS | 
| SCC | ATG | GTS f GTS | GTT |ATC I GTT | GOT | GCT | - 425 
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Table iG2bt Annotated Sequence 
af ter insertion of SaU linker 



j T | I | Q I I | 
\ 117 } 118 iilS (120 | 
|ACCfATCjGGGfATC{~ 



1 121 j 122 \ 123 1 124 j 12 S 1 126 f 127 j 128 j 129 j 130 j 

| AAA I CT£r I TTC | &&G \ A&G | TTT f ACT | TCG f AM | GO® | - 467 



IS 

I s ! > | , ] » ( 

| 131 | 132j 133 | 134j 

•|TCr|tAA] jP3A|TAG[ MS- 48S 

2D 

S21 
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Note the following essyss© equivalences, 



Wm HI -MI 
am xi - is&QiSi x 



ACQ III 

to ii 
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J 

rt 

I 



^ rt 

H «"* 

^ -v *ci? ?- ! ffl ♦sis ♦ 

^ «, w g$ as fsg 

?* I & • ml 

I I I I I m & m o 

t( H ^ fli tf » i ft W * * 

fl! £ J? € £ ^ < e ^ « 4 

4 «S 0 # s$ <K ?s « i$ Rt ■ * 

41- «" i«: > «S « SS S8 » ® 

> 8 ft 0< ■ -Hi >« N >~s H H 

pt! OS H «J *i «M «M g g ^ 

.u t$ v «a ft ffi' ft ft ft 

<w . > >w M > > ** ** ^ 

g rt «S > <~S > > H H r~* 

rt x*; r-S «$ 01 rt rt a a W 

p, AJ «f a* ■« : > ;J3S ft # «J «S 

rt -u ft m <3 sf r? > ■ > > 

rt rt > ft > m ® m *4 a « 

rt « > > > <~* ® » W 

rt -m &i ft b m ^ *s *3 h 

w m m -ta m tA & & 

W O & <& w 

3SS 5? g H *a M «! 

sh a s» &« ft Ni M 

mi s h a* g £ 

: ; S J, 

M H H H M 

1 I I alii i 1 1 1 
s 
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rabl C 107 I ^ u , ar r; 3 $ 

analysis of veete>r~enc©ded 
signal : ; BPT! t : mature VIII protein species 

s _ _ 3i m mm&m* J^JsC.^slssL 

No DM {control) 



pOEKf-3Zf (-J 
pGEM-MBl€ 




Notes; 

IB a,} pre~beta~ lactamase, encoded by tne ssm gene* 

fe.) pre-BPTX/Ylll peptides encoded by the synthetic 

gen® and derived constructs. 

c.) » for absence of product i ♦ for presence of 

product; for Not Determined. 
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Table 108; Western. analysis* of ia xisa 
e^ressed 

signal: iBPTI: : mature ¥1X1 protein species 



5 A) expression, in strain XLX-Blue 

pSBM-MSlS ¥1X1 
10 pGSM-KB26 VIXX +++ */- 



in strain SBF' 

phOA 4/~ *+* 



Potest 

a} Analysis using rabbit aati-BFTX polyclonal antibodies 
and horse-radish-peroxidase- conjugated goat anti -rabbit IgG 
20 antibody. 

b) pro-BPTX/vXIX peptides encoded toy the synthetic gene 
and deriired constructs, 

c) processed BFTl/vXiX peptide encoded by the synthetic 
gene. 

25 d) not present ,.„«...*. 

weakly present . , , , ♦ +/ - 
present v.„,. < + 

strong presence 

very stx'ong presence 
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Table ic i em HI 
1579 5 ' -OT fittMffim TTATTCGCRA TTCCTTtAGT 

1611 TGTTCCTTTC TATTCTCACT < 

XSSI TGTTTAGCAA AACCCCATAC ; 
5 XSSI 
1731 
1771 

1811 TTGGGCTTGC TATCCCTGAA , 

XSSI SGGTQGOGGT 1XTIGAGGGTG GOSGTTCTGA GGOTSGCSQGT 

10 18.91 ACTAA&CCTC OTAGTACGG TGATACACCT ATTCCGGGCT 

1931 ATACTTATAT C&ACCCTCTC GACGGCACTT ATCCSCCTGG 

1971 TACTGAGCAA A&CCGCGCTA ATCCTMTCC TTCTCTTSSS 

2011 GAQTCTCAGC CTCTTAATAC TTTCATGTTT CMMfMl'A 

2051 GGTTCCGAAA TAGGCAGGGG SCRTTAaCTS TTTATACSGG 

IS 2091 CACTQTTACT C&mGG&Cm ACCCCGTTAA AUGTTATTAC 

2131 CAGTACAGTC CTGTATCATC AAMGCC&ro TATOACSCTT 

2171 hmmmsm taa&toaga gactgcgctt tccattctgo 

2211 CTTTAATGAG GAXXX&TTCG TTTOTOftATA TC1UK&SCCIVA 

2251 TCGTCTGACC TGCCTCAACC TCCT8TC&AT GCT^SCGGCG 

20 2291 CfCTCTGOTGQ TGGTTCTGGT GGCGGGTCTG AGGGTGGTGG 

2331 CTCTG&GGGT GGCGGTTCTG 3M3GQTQGOQG CTCTGftGGGA 

2371 GGCGGTTCCG GTGGTGGCTC TGGTTCCGGT GATTTT3ATT 

24X1 ATOMAAGAT GGCA&ACGCT AAXAAGGGGG CTATCACCGA 

245 1 ARATGCCGAT GAMAGGCGC TACAGTCTGA CGCTAAAGGC 

25 2491 AmCTTGATT CTSTOSCTAC TGATTACSSOT GCIGCTATCG 

2531 ATGGTTTCAT TGGTGACGTT TCCGGCCTTG CT&ATGGTAA 

2571 TQSTSCTACT GGTGATTTTG CTfSGCTCTSA TTCCCAAATG 

2€11 GCTCA&GTCG GTGACGGTGA TAATTCACCT TTMTOMTA 

2651 ATTTCCGTCA ATATTTACCT TCCCTCCCTC AATCQGTT8A 

30 2691 ATGTCGCCCT TrrGTCXTXA GCGCTSGTAA ACCATATGAA 

2731 TTTTCTATTG ATTGT8&CAA AATSAACTTA TTCCGTGGTG 

2771 TCTTTG TGCCACCT TTATGTATGT 

28X1 ATTTTCTACG TTTGGTAACA TACTSCGTAA TAAGGAGTCT 

2351 TAATCATGCC AGTTOTTTG JGTATTCCGT 
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Table no : introduction of ®a£X into gone XXI 
A) Wild- type IS, portion encoding the signal peptide 



M K & L L F A 



1 2 3 4 5 S 7 
1579 S ! -GTG A&A m TT& TTC SCA ATT CCT TTA 

/ Cleavage site 

¥ v" P F Y 8 H S ^ A B T V 



B) XXX, portion encoding the signal peptide with m& site 

20 » fc * 1 1 f a I p 1 

1 2 3 4 S £ 7 8 9 10 
1ST? S ! ~gtg saa aaa tta tta ttc gca att cct tta 

2$ / cleavage site 

v v p f y s G A v a e t v 
11 12 13 14 IS IS 17 18 19 SO 21 22 
1609 crtt gtt cct ttc tat tct <3Sc Gee get gaa act gt,.^ f 



PCTAJS92/0I539 



Table 113 ; Asm< itecl S . j«j e 
PGEM-MB42 CCWJM a< 3 U3GA iTM A 



-10 



IseO-syrnm operator 



till 

atgIaaaIc 



0 j S I T j 
3 4 5 
\ &m | AAA | caa j AGC I ACG I - 
phoA sig*^ 



X j h \ & j L I I, U I F S T I P I V I T I 

6 7 I 8 9 } 10 11 j 12 | 13 1 14 15 16 j X7| 
:XC | GC?i I CMC \ 'ITT A j CCG TTA i CTG- { TTT i ACC j CCT i GTG j ACA I - 



pboA signal 



phoA signal - > A j &SS&U. 

PhoA/BFT.1 jftct 



C | I» E 
28 j 28 30 
GAT | TTC | TOT 1 CTC j GAG j 



| Pi f [ Y J T I S j P 1 C I K [1 ! I j 
' 37 j 38 | 39 j 40 j 



WO 



1.0 



25 



Table 113 

Ptacx TC8S KSQAQQ^ikTMA} i i 



C MR 
! SI 82 



\ X 


& 


Y 1 


42 


43 


44 


\M!C 


ccc 


TAT ) 


h 


c 


Q \ 


52 


S3 


54 




TGC 





45 | 4S J 47 48 49 SO 



A | It 
63 64 



X™1 



| F 


1? 


Y 


G 1 


| 50 


57 


58 


59 I 


i TTT 




TaC 


SOT: 




■■■■Ifgg 


nr 




1 N 


M 




K j 


| <SS 


67 


68 


69 




atc 


TTT 





|s|Ajs|r>|ciM|R}T|cje 

I 70 71 j 72 1 73 j 74| 75 76 1 77 j 78 j 79 1 
I TOG GCC § GM T x . ' : - s ' /■ V > I CGT | 
' • ^ .C.Tc::..,L. 

insert- 



G A 
80 81 

M 3 



BPTX/MI3 boundary 

v! A j E | G | D | t> j P j A 
82 j 83 1 84 85 8 

:x?jG€C 



j E£0: 



90 1 91 
aag|gcg|gcc 



mature gene ¥111 coat protein 



F 


to 


fi 


h 


Q 


% 




32 


S3 


34 


35 








TTC 


A&T 


TCT 


cm 


CAT 


GCT 




E 


Y 


1 


G 


Y 


A. 


w j 


101 


102 


103 


104 


1.0 S 


106 


107S 


TAG 


TAT 


ATT 


GGT 


TAC 


GCC 


TGC i 



j T 1 
|100| 



WO 92/156"9 
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Table 113 c Annotated Sequence of 
Ptac t - L ? AA&> ; : 

(continued) 



U I M I V | ¥ j ? j I { ¥ | G U j 
108 | IDS 110 ^ 111 | • 4 115 1116 | 




30 AGTCT/* • , 3CT TTTTTTTT* 

— Jtaaatotpr 



TS5T ! 



wo misffl 



PCT/US92/01539 
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Table 124: mu - cio* ! ; sge Titer Usi»s 
Agarose - immobilized Anhydro-Trypsiix 

Percent Residual Titer 

As a Function of Time {hours} 







I^^^M&jt; ioffl , , „ 










MJC-BPTl 


5 #i its 




104 


105 






2 Ml IAT 


82 


71 


St 






S jsi » 




40 


27 






10 |i l IAT 


40 


30 


24 






S Ml IS 




96 


9:8 






2 jtl IAT 


-97 


103 








S j& IAT 


lid 


111 


m 


IS 




10 01 IAT 




S3 


106 



IAT - ismateilized anjjydro- trypsin 



wommm 



FCI7US92/91539 



?abl« us i Affinity Selactioa of H&-BFTX Shage 
on XtwoMXissea mhy&m~ f Pr$p&ln 

Percent, of Total Phage 
f ^ Edition B»S5vm 4 ffiJ*aU*MttU««»- 
5 i&l IB <<1* 

2 «a w s 

10 Ml XAT BO 

5 £1 IS <<X* 
2 M l XAT 

5 Ml ™ «1 

10 Ml I»* « <X 



20 



XS « xaraobiliMSSI *t*6pt»viai» 
XAT - X»l3ili»0d anhydro-tr^sin 
» not detectable. 



WO 92/15679 
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Table liSs translation of Et&ml^LXX; ife^is :5MuaBblZI 

% a 3 4 3 ■« .7 8 9 IS 13- 12 X3 14 IS 
£M K . K L X* F A I P L V V * F Y 
|TG AAA ASA OTA^TTA^C^G^ATT^CCT W& GTT ^f^f;^'^ 

16 1? 18 19 20 21 22 23 24 25 26 2? 28 23 30 
S G A. & P » F C h S 3? P ? * G 
TCT GGC SCC |<?t ^^ g ^ ^j gr^ ???: ??r! ?^?^??^ 

31 32 33 34 35 36 37 38 33 40 41 42 43 44 45 
P C 2C A R I X & T F T N A. K & 

jgggJfegs^^....g:gg. ..sasLatc ate- <?«u^ fc&LJJ: - ^ „^£&~&&&~g£§: 

46 4? 48 49 SO 51 52 53 54 55 53 5? 58 59 60 
O L C 0 T F V Y S © C ^ A K R 

€1 62 63 64 65 66 67 68 69 70 71 72 73 74 75 
» N F X S A 8 D C M R T C S G 

76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 

A G A A E V V E S C L A X P H 
: ' r GAA ACT QTT AGT TGT TO GCA A*A ^ 

1«- mature gen® III protein — ------ 

91 92 93 94 95 98 97 93 99 100 101 102 103 104 105 
T 8 H S F T » V W K 0 © K * JL 
ACA GAA AAT TCA TFT ACT AAC GTC TGG AAA GAC GAC AAA ACT TTA 



12 S 

Table 11 Si translation of ;ffla&ttCBL^XII 



206 

D 
GAT 


107 

R 
COT 


108 
Y 

ac 


3.09 

A 
OCT 


110 

m 

AAC 


111 

Y 
TAT 


112 

E 
GAG 


113 

G 
GGT 


114 
O 
TGT 


115 

h 
CTG 


11 6 

W 
TGQ 


11? 

H 
AAT 


118 

A 
GCT 


119 
T 
ACT. 


120 

G 
GGC 


121 

V 
GTT 


122 123 

V V 
GT& GTT 


124 

C 
TGT 


12 S 

T 
ACT 


126 

Q 
GGT 


127 

X> 
GAC 


126 

GAA 


129 

T 
ACT 


130 

Q 
CAG 


131 

C 
TGT 


132 

If 
TAC 


133 

G 
GGT 


134 

T 
ACA 


13 5 

W 
TGG 


136' 
GTT 


137 

P 
CCT 


138 
ATT 


139 

G 
GGG 


140 
CTT 


141 142 

A I 
GCT ATC 


143 

P 
CCT 


144 
& 
GAA 


14S 

N 
AAT 


146" 

S 
GAG 


147 

G 
GGT 


148 

G 
GGT 


149 
G 
GGC 


ISO 

s 

tct 


151 
GAG 


152 
C 
GGT 


153 

G 
GGC 


154 
CCT 


155 

8 
TCT 


15 S 

vB 
GAG 


157 
G 

ggt 


158 

G 
GGC 


1S9 

G 
GGT 


160 

23 
TCT 


161 
GAG 


162 

G 
GGT 


163 

G 
GGC 


164 

G 
GGT 


1SS 

T 
ACT 


166 
K 

aa& 


167 

P 
CCT 


168 

P 
CCT 


169 

E 
GAG 


170 

Y 
TAG 


171 
GGT 


172 

D 
GAT 


173 

T 
ACA 


174 

jp: 

ccr 


175 

1 
ATT 


176 

P 
CCS 


177 
G 
GGC 


178 

Y 
TAT 


179 

T 
ACT 


180 

Y 
TAT 


181 

X 
ATC 


182 

SSE 
MC 


183 

P 
CCT 


184 

t» 

CTC 


1SS 

D 
GAC 


186 187 

G T 
GGC ACT 


138 

Y 
TAT 


189 

P 
CCG 


190 191 192 

P G T 
CCT GGT ACT 


193 

E 
GAG 


194 195 

Q » 
CAA AAC 


196 
P 

ccc 


197 

A 
GCT 


198 
AAT 


IS 9 
P 
CCT 


200 

N 
AAT 


201 
CCT 


202 
S 

Ter 


203 

L 
CTT 


204 

E 
GAG 


205 

E 
GAG 


206 
TCT 


207 
Q 
CAG 


208 

P 
CCT 


209 

Is 


210 
SAT 


211 

T 
ACT 


212 213 
F H 
TTG ATG 


214 

F 
TTT 


2 IS 

Q 
CAG 


216 

K 
AAT 


217 

N 
AAT 


218 
R 


219 
F 
TTC 


220 
CCA 


221 

® 
AAT 


222 

R 
AGG 


223 
Q 
CAG 


224 

G 
GGG 


225 
A 
CCA 



PCT/US92/0J539 
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Table US i translation of gl ^al»XXI ? sfeg.tl, ? s»&S3^I£X 
{continued} 



226 

L 
TTA 


227 

T 
ACT 


22 S 
GTT 


229 

Y 
TAT 


230 

T 
ACS 


231 

a 

GGC 


232 

T 
ACT 


233 

V 
TTT 


234 

T 
ACT 


23S 

Q 
CAA 


23e 
s 

GGC 


23? 

>T 
ACT 


238 

D 
GAC 


239 

P 
CCC 


2 40 
GTT 


241 
K 

aaa 


242 

T 
ACT 


243 
T 
TAT 


244 

T 

mc 


245 

6 

CGG 


246 

Y 
TAC 


24? 

T 
ACT 


248 

P 
CCT 


249 

if 

GTA 


259 

S 
TCA 


251 

S 
TCA 


252 

K 
AAA 


2 S3 
A 
GCC 


254 

M 
ATG 


255 

Y 
TAT 


25 S 
D 


25? 

A. 
GCT 


255 

Y 
TAC 


259 

W 
TGG 


2eo 
* 

AAC 


261 

G 
GGT 


262 

k 

AAA 


2 S3 
F 
TTC 


264 
& 
AGA 


265 

D 
GAC 


258 

C 
TGC 


26? 

A 
GCT 


26S 

F 
TTC 


269 
H 
CAT 


270 

B 
TCT 


271 

0 
GGC 


272 

f 
TTT 


273 
AAT 


274 

s 

GAG 


275 

D 
GAT 


276 

V 
CCA 


27? 

F 
TTC 


278 
V 
GTT 


279 
C 
TGT 


280 201 
GAA TAT 


282 

Q 

CA& 


283 

G 
GGC 


284 

Q 
CAA 


285 

S 
TCG 


286 

8 
TCT 


28? 
GAC 


288 

h 

cm 


280 

I? 
CCT 


290 

0 
CM 


291 

P 
CCT 


252 

CCT 


293 
V 
GTC 


294 

N 
AAT 


256 

A 
GCT 


296 

G 
GGC 


297 

G 
GGC 


298 
G 
GGC 


239 

S 
TCT 


300 

G 
GCT 


301 

G 
GGT 


302 

G 
GGT 


303 

s 

TCT 


304 

G 
GOT 


SOS 

G 
GGC 


306 
G 
GGC 


307 
TCT 


308 

E 
GAG 


309 
G 
GGT 


310 

Q 
GGT 


311 

G 
GGC 


312 

B 
TCT 


313 

8 
GAG 


314 

G 
GGT 


315 

G 
GGC 


316 

G 
GGT 


317 
TCT 


318 

i 

GAG 


319 
G 
GGT 


320 

G 
GGC 


321 

G 
GGC 


322 

S 
TCT 


323 
£ 


324 

G 


325 

a 

GGC 


326 

G 
GGT 


327 

S 
TCC 


328 

G 
GGT 


323 

G 
GGT 


330 

G 
GGC 


331 

8 
TCT 


332 

G 
GGT 


333 

& 
TCC 


334 
G 


335 
D 


336 

F 
TTT 


337 

D 
GAT 


338 

X 
TAT 


339 

23 
GAA 


340 
K 

aag 


341 

M 
ATG 


342 
A 
GCA 


343 
N 

mc 


344 

A 
GCT 


345 
AAT 



wo nmm 
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Table 116: translation of Si3aal.r,IXI ' jfeB&t* igaa^El 



346 

K 
AAG 


34? 


348 
A 
OCT 


349 

M 
ctg 


350 

f 
ACC 


3S1 

S 
GAA 


352 

H 
AAT 


353 354 355 

A B B 
GCC GAT GAA 


356 357 

N A 
AAC GCG 


3S3 
1* 

cm 


3SS 

Q 
CAG 


380 

S 
TCT 


361 
GAC 


OCT 


3 S3 
AAA 


354 
G^ 


365 
K 


366 
CTT 


3€T 

S3 
GAT 


368 388 
S V 
TCT GTC 


370 

A 
GCT 


371 
T 
ACT 


372 
B 
GAT 


373 
T 
TAG 


374 

d 

GGT 


375 

A 
GCT 


376 

& 

GOT 


377 

I 
ATC 


378 
GAT 


378 
G 
GGT 


380 
TTC 


381 

X 
ATT 


382 

0 
GGT 


383 
GAG 


3$4 

V 
GTT 


385 

S 
TCC 


386 

G 
GGC 


387 
CTT 


38S 
A 
OCT 


389 
AAT 


390 

O 
GGT 


391 


332 393 394 

G & T 
GGT OCT ACT 


395 

G 
GGT 


396 

B 
GAT 


39? 

F 
TTT 


398 

A 
GCT 


399 
G ' 
GGC 


400 

s 

TCT 


401 402 
AAT TCC 


403 
Q 
CAA 


404 405 

N A 
ATG OCT 


CM 


407 
GTC 


408 

G 
GGT 


409 
GAC 


410 

G 
GGT 


411 
GAT 


4X2 
K 
AM 1 


413 

$ 
TCA 


414 

F 
CCT 


415 

h 
TTA 


416 

M 
ATS 


417 

n 

AAT 


418 
AAT 


419 

F 
TTC 


420 

R 
COT 


421 422 

0 ? 
cm TAT 


423 
L 

tcc 


424 
CCT 


425 

8 
TCC 


426 427 

Is P 

CTC CCT 


428 
Q 
CM 


429 
TCG 


430 

V 
GTT 


431 

S 
GAA 


432 

C 
TOT 


433 
E 
CGC 


434 

P 
CCT 


435 

F 
TTT 


436 
GTC 


437 

F 
TTT 


438 

S 
AGO 


439 440 

A G 
GCT GGT 


441 
AAA 


442 443 444 

PTE 
CCA TAT GAA 


445 

F 
IT" 


446 

S 
TCT 


447 

X 
ATT 


448 

D 
GAT 


449 

C 
TGT 


450 

D 
GAG 



WO §2/15679 
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Tafele 34.6 s translation of gAgnfrt - III ? slaB&l? s^&a&fc^XXX 

4.«5X 4§2 453 454 455 *S6 4S7 458 .45* 460 46X 462 4 S3 464 465 

K X W X, F * © V F A F L L Y ¥ 

jRM ATA AAC TTA 1" : ^ ^lffig% 

|< u iicr i 

466 48? 468 469 470 4*?X 472 473 474 475 476 477 478 475 480 

(3CC ACS TTT ATO TAT .G T A JTrf TCT^j ■^mSLMSK^ CGT 

481 482 483 484 485 

H K B S 
AAT AAG GAG TCT TAA 



WO 93/15679 



FO7US9.2/01S39 
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Table 3.16; translation oi 4» - ill-- s fc£ \i ' <*\>x§ Hi 
{continued} 



Second 





t 


c 


a 






t 


IS 


2X 


IS 


I 


t 




12 


5 


10 


6 


c 




10 


4 


0 


0 


% 




0 


3 


0 


4 


9 


c 


6 


ao 


2 


8 


t 




: a 


4 


0 


3 


C: 






4 


$ 


1 


a 




4 


3 


7 


0 


g 



a 


a 


is 


21 


1 


fe 




5 


4 


11 


1 


C 




.2 


4 


16 


1 


a. 




8 




4 


2 


g 


sr 


13 


22 


14 




t 






7 


12 




c 




4 


£ 


12 




a 




i 


3 


1€ 


4 


S 



A 
F 
K 
P 
T 



# 

37 
2? 
20 



M # 

C 14 

0 75 

L 24 

Q 16 

V 23 



D 26 E 



S J 

W 4 



U 
28 



1 1 



WO 



fCT/vsnmsy) 
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Table 13 D; Saj^Ufcgr of a Xsibirary encoded by (MRK)* 
Numbers of hexapeptides in each class 
stal - 64 ,000, OOP stop- free sequences. 




7464960. 
7776000 , 
2799360. 
7776000 . 

§33130. 
3240000 . 
1166400. 
225000. 
810000 . 
145800, 
15625. 
84375, 
30375. 
729. 



-uaa, for example, stands for the set ©f pej 
two amino acids from the « class, two frost *, aM t' 
•>< tere are, for 
entirely of 8„ L, and R. 



having 



WO 92/15679 
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Tables 130; 



plirjg of a Lib; ^ j ic i by (M-tlO 5 
(continued; 



Probability that 

enco<S© a hax&poptid© fro® a stated eiaessu 

p % of class 

U.13B-07) 
(2.25K-07) 
{3.38E-Q7} 
C4.3XB-G7) 
<6.76B-07") 

(9.018-8$) 
U.35B-06) 
{2.03E-O6) 
{3.048-06} 
(X. 808-06) 
(2, TOE -06) 
(4.068-06) 
(6*008-06) 
CS.13B-06) 
{3.6XE-06} 
(S.4XE-06) 
(8 « 118-06) 
U. 228-05) 
(1.83E-05) 
(2.74B-05) 
(7.218-06) 
(1.08E-0S) 
(1.628-05) 
{2,438- OS) 
{3.652-05} 
{5.48B-05} 
{8.21E-05} 





3.364E-Q3 


^mmmx® . , „ 


1.682K-02 




1.514B-02 


« . 


3.S0SE-02 




6.3088-02 


DQjsa'ote, . , 


2.839E-02 


#^#of«a\ . * 


3*8948-02 


**0aaar. . , 


1. 0518-01 




9.4638-02 


0QQ<xctar. » . 


2.8398-02 




2.4348-02 




8.7628-02 


mmtm. « . 


1.183H-01 


♦QDOaa. . . 


7.O97B-02 


fiQQ£Ja« « . , 


X, 5978-02 




8.XX3E-03 




3. 66181-02 




6, 571.® -02 




5.914E-Q2 


. 


2.661E-02 


QSKJOOa... . 


4. 79 OB -03 




X.X27E-03 


#4*M«! * ! 


S.084E-03 


****GG. » . 


1.36SB-Q2 


***00£2. . , 


1. 64 31? -02 


**0QQQ» . . 


1.109B-02 




3.392E-03 


sqqqod , , . 


5.988E-04 
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SO; Samnlinq of a Library «siCoa®S by 1KHK) 6 
" (continued) 

€. number of different stop-free amino -acid sequences 
each, class exacted for various library stzm 

Library sis© - 1*000:02*06 

total «« 9. 744612*05 % sailed - 1.52 



Nuasber % 


3362 


Si 


.-1) 


2.5114 


6 { 




62871 




:S 


38765 


:i 




93672 




2,0) 


24119 


9{ 


1,8} 


1X5915 


5{ 


4,0) 


152 SI 


1{ 


8.7) 


35537 


2( 


53) 


55684 


4C 


11. 5} 


4190 


6{ 


24.0) 


5767 


Q< 


10. 3) 


14581 




21.6} 


3073 


si 


42,2} 



18303.4 ( 
34967 , 8 ( 
2S244.3 t 
104432,2 { 

86442.5! 
68853,51 

7968.11 
83117,5 C 
34325.9 ( 

1087.1 ( 
12637,. 2 ( 

9290 -3 { 
408.41 



1,0) 
1.3} 
3,0} 
2.7) 
5,9) 
3.5) 
7,8} 
16.7} 
7.0} 
.13,0} 
30.6} 
56.0) 



Library sisse •« 



•; .oo o - ;:; • 



2.7885E+06 % sampled 





10078.4 ( 


.3} 




50296.9 ( 




7} 




45190.3 ( 


1,0} 




104432.2 ( 


1 


3} 




187345.5 ( 


200} 




33880.91 


3 


05 




1.15236,61 


2.7} 




309107.9 1 


4 


0) 




275413,9 { 


5,9} 




81392 . 5 { 


8 


?'} 




71074. 5 ( 


5.3) 




252470,2 { 


7 


8} 




334106.2 C 


11.5} 


*8QQ«a. . 


194606.3 { 


16 


7} 


OSSSaa. . . 


41905,9 ( 


24.0} 




23067.81 


10 


3} 


* . 


101097.3 \ 


15.0} 


#**00ff ! * 


174981.01 


21 


€) 


mmm« . , 


148643, 7 { 


30,6} 




61478.3 1 


42 


2) 




9801.0 ( 


56.0) 




3039,81 


19 


S) 


#**«*B . . . 


15537,7 { 


27.7) 




32518. 8 { 


38 


5} 


4M>$QQQ . , < 


34973.8 ( 


51.8} 


<£#0OSO . . 


20215. S{ 


66 


6} 




5S73.9 ( 


80.7} 




667.01 


91 


5} 
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Table 130s Satspling of a Library encoded by <NNR}* 
(continued) 





Library si 


sse * 1 


O0G0B*O7 








5 


total » 


8vl204E*O6 % sasapl^d » 


12 , 69 








ot&aaaa , . , 


33455. S< 


1.1} #0£(X®a?< 


x. . . 166342 


4{ 


2.2} 






148871.1 C 


3,3} ««aeoKK 




7{ 


4.4} 


10 




609987.6( 


6,5) Q&mm&i 


• • 269958 




9.6) 






372371.8 < 


8.6} **Q«a< 


V. . , 9S341S 


l\ 


12.6; 




iSSSSS * I * 


856471, 6 ( 


18.4} SDto 


, , 244761 


Si 


26.2} 






222702 ..©■< 


16.5) 


v. 767692 


Si 


26.7) 






$72324, 6 { 


33.3} lOSte 


531651 


3C 


45.6} 


IS 




104722 , 3 i 


59.9} m^m 


r, . , 68111 


0( 


30.3} 




****Qoe- - - 


2819?€,3 t 


41.8} 


*. 450120 


2 { 


SS.S} 




mQQQa* . . 


342072.1 { 


70.4} $0000, 


*\ . . 122302 


€{ 


83.9} 




OOOOQa.. . 


16364.0 C 


93,5} $#4>4*> 


?• . , . 8028 


oc 


SI. 4} 






37179.9 { 


mom 


i.., 67719 


S{ 


60.3; 


20 


■***ano. . . 


61580.0 C 


9i.»2) mm® 


5... 29586 


1{ 


97.4) 




4H2OQ0Q. . . 


7359. SI 


99,6) OSOSfih 


J* . . 728 


8(100*0) 



Library aizn - 3.0OOOE*O7 
total « 1.8633H+07 % 



29247 
431333 
1712943 
1023590 
2126605 
563952 
20S2433 
163640 
541755 
473377 
17491 
54058 
874S4 
7290 



.4< 3.3) 
.31 9.6) 
,0{ 18.4) 
,0 ( 23.7} 
.0{ 45,6} 
.6{ 41.8} 
.01 70*4} 
*3{ 93.5} 
.7{ 80.3} 
.0( 97.4} 
,3(100.0) 
.X{ 96.1) 
.S{ 99.9) 
.0(100,0) 





. 487990 


0( 


6.5) 




, 983416 


5( 


12.6) 




. 734284 


6{ 


26.2) 




. 2592866 


0( 


33.3) 




. 558519 


0( 


59.9) 




. 1000481 


o( 


55.6; 


<K?QO«a% , 


* 978420 


Bi 


83.9) 




. 148719 




66.1) 


***OOof! . 


. 738960 


a 


91.2} 




, 145189 


7{ 


93.6) 




13829 


1< 


98 .5} 


****Qn. . 


83726 


0( 


99 .2} 


4*0000, . 


30374 


5 (100.05 




729 


0(100.0} 
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Table 130: Sampling of a 



Xiitary encoded by tmft* 
mum 



Library sise * 7, 
total - 3,31258*0? % 



2450S7.8 
1014733-0 
37431X2.0 
2142478.0 
3 86 8785. 0 
1007002.0 
27823SS.0 
174790 .0 
€63929.3 
485353.2 
17496.0 
58234.9 
67500.0 
7290.0 



Library size 



. 8,2) 
{ 22.71 
40.2} 
C 40.85 
( 78. S) 
£ 74, 6 J 
{ 95.4} 
. 99.9} 
t 98.4} 
(100.0) 
(108.0) 
(100.0) 
(100.0) 
(100.0) 

1. 



1175010 
225S380 
IS 04 12 6 
4393247 
840691 
2025063 
X1543S6 
21047S 
808298 
145799 
15559 
$4374 
30375 
729 



.QC 15 
>C{ 29 
.Of S3 
,0( 64 
,9{ 90 
„0( 87 
>0( 99 
.6( 33 
,£>{ ,93 
.9(100 
.9( 03 
,6(100 
.0(100 
,0(100 



,7) 
.0} 
.7) 
.2} 

.15 
.25 
.0) 

:!] 

,0) 
.05 
.05 
.05 
.05 



3.65373+07 % 



318185 


X( 10 


7} 




. 150 6X6X 


0{ 30 


2) 


1284677 


0( 28 


75 




, 3821285 


0( 36 


3} 


4585163 


Of 49 


15 


&®<mmx, . 


. 1783932 


0{ 63 


75 


2586085 


0{ 59 


45 




. S764391 


0( 74 


15 


4051713 


0{ 86 


35 


OQQmxat » . 


. 888584 


3( 95 


35 


1127473 


0( 83 


Sj 


#**0oa. . 


. 3023170 


0( 33 


35 


2865517 


0( 98 


3) 




» 1163743 


0( 39 


S5 


174941 


0 (100 


0) 




. 218888 


S( 97 


3 5 


£7X976 


9( 99 


6} 


***»Ba! . 


, 809757 


3 (100 


0) 


485997 


5 (100 


05 




. 145800 


.0(100 


0 5 


17498 


0(100 






X3613 


S{ 89 


95 


S6248 


9 (100 


: 


****as. ! 


84375 


0(XOO 


0 5 


67500 


0(100 


0 5 


4HM2080.. 


3Q375 


0(100 


0) 


7290 


0(100 


05 


saosso . . 


729 


,0(100 


05 
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'able : Saapli ! I ary encoc 
{ continued) 



Library ; 
total * 



"JMKJQaot. 



QOQOQa. 
4M30SSQO. 



3 .0000B+08 



5 ,26348+07 % &as£3pl$& . 



856451, 
2854291, 
8103426,. 
4030893. 
4654972. 
1343954. 
2315985, 
174960. 
€74999. 
486000. 
1749 6, 
56250 
67500 , 
7290 . 



3C 28.7} 
0C 03,7} 
0C 80.8} 
0( 93.3} 
CM 99.8} 
0( 99, 6} 
0(100.0} 
0(100.0) 
9(100.0} 
0(100.0} 
0 {100.0} 
0(100,05 
0(100.0) 
0(100.0) 



mam® 



mmm 



3668130 
5764391 
26687S3 
7641378 
933018 
3233029 
1166400 
224395 
810000 
14S800 
IS 623 
94,375 
30375 
729 



,0( 49.1) 
,0C 74 a) 
,Qf 9S. 2) 
.0( 98.3) 
.6(100.0} 
.0(100,0) 
.0(100.0) 
.5 (100.0) 
.0 (100.0) 
i 0(100.0} 
.0(100.0) 

.o c loo.o) 

.0(100.0) 
.0(100.0) 



Library size * i.0000E*03 

25 

total »i S.X999S+07 % sampled - 96.87 







. 2018278. 0( 67 


m 




6680917.0 


S3 


5) 






. 4 326519. &( 96 


65 


mmota. , 


7630221. 0( 98 


9) 


30 




. 9320369.01 99 


9) 


QOtoetmix-* > 


2793250.0(100 


0) 






. 4319475.0(100 


0) 


#^Soraa' . . 


7775930.0 


100 


0) 




#8Qaee*a. . 


. 4665600,0(100 


0} 


QQQoraraf . . 


333120.0 


100 


05 






. 1350000.0(100 






3240000.0 


10 Q 


0) 




$4?QQ®0£. , 


. 2916000.0(100 


0} 


&Q£D«s: . . 


1166400.0 


100 




3S 




. 174960.0(100 


0} 




225000.0 


100 






4*M*$S« . . 


, 675000.0(100 


0) 


***0Qa. . 


810000 .0 


100 








, 486000.0(100 


0} 


SfiKiOOa. . 


145 8 00.0 


10 0 








17496.0(100 


05 


« 


IS 60S ,0 


100 






****** . , 


56230.0(100 


0) 




34375.0(100 


0) 


40 




67500.0(100 


0} 


**80BQ. . 


30375.0(100 


0} 






7290.0(100 


0) 


QQQQfiB, . 


729.0(100.0) 
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Table 130s Sampling 



of a Library encoded by 
{ continued) 



Library size * 3*0000®*09 
total » S.3890B+07 % sampled « 



15 



2 884346 
4478800 
0331200 
4320000 



1350000 
29X6000 
174860 
$'75000 
486000 
17496 
56250 
67500 
7290 



.01 9$.$) 
0(100.0} 
,0(100.0) 
.0(100,0) 
.0(100.0} 
.0(100.0} 
0(100.0) 
0(100.0} 
0(100,0} 
0(100,0} 
.0(100.0} 
0(100.0} 
0(100.0} 
0(100.0} 



7456311 
777S090 
2799360 
7776000 
§33120 
3240000 
1166400 
225000 
810000 
145800 
15625 
843 75 
30375 
729 



,01 99.9} 
.0(100,0) 
.0(100.0) 
.0(100.0} 
.0(100.0} 
,0(100.0} 
,0(100.0} 
,0(100,0} 
,0 (100,0) 
.0(100.0} 
.0(100.0} 
.0(100.0} 
.0(100.0} 
,0(100.0} 
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Table 138, continued 
for tabulated quantities. 



10 



31**6 is 31 to sixth power? 6*3 : 

A - hBize/ (31**6) 

0? can be one of f WMFY C I KDEMHQ . ] 

* can b® one of [PTAYGJ 

0 can be one of CSLR] 

FO - <22)**6 FX - U2)**S 

F3 *> (X2)**3 F4 * M2}**2 



{12)**4 
{12} 



30 



* FO * {l-exp<«A)J 
*. 6 * 5 * FX * fl-ex»{ -2*A) } 
» « * 3 * Fl * (X-exp(»3*A) ) 
» {IS.) * 5**2 * F2 * (l~exp(-4*A)} 

ftOttttOtt » C6*5)*5*3 *F2 * <l-*x»f-«*A> ) 
QQcuam » (IS) * 3**2 * F2 * (l~Ma>(-»*A}l 
®s®am * (20)* (5**3) * F3 * ii*m (-&*&} ) 
§>*tea * (60)* (5*5*3) *F3* U~exp{'»X2*A) ) 

- CS0} * (S*3*3) *F3* (X-exp C ~2S*A) ) 
OOOomm * ( 2 0 ) * { 3 ) * * 3 * F3 * ( X - «xp (-27 *A } ) 
m®$®& m (15) * (5) *M*F4* (I-expM6*A) ) 
««*Qoa * (60) * (5) **3*3*F4* (i-exp ( -24*A) ) 
OKKtea « (90) * (5*5*3*3) *F4* £X~exp{ ~3S*A1 ) 
SQQQosx « (SO) * (5*3*3*3) *P4* (1-exp ( -54*A) i 
WW®® - (1S)*(3}**4 * F4 *(X-exp(~8X*A)} 

* <«> * (5) **5 * FS * (i>exp{~32*A)! 
****G<* » 30*5*5*5*5*3*F5* CX-e3CB{~48*A> S 
***S0« * SQ*S*S*5*3*3*FS*(l-exp(~72*A5 ) 

- - 6C*S*S*3*3*3*FS* (X-exp(~l08*An 
$DQSOa » 30*5*3*3*3*3*F5*{l-exp{~262*A) 5 
000000! » 6*3*3*3*3*3*F5* (l~«acp{-243*A) ) 
****** « 5**6 * U-e»p{-64*A)'> 
*****n - 6*3*5**5* (l~exp{-SS*m> 
****00 - 15*3*3 * 5**4 .* { 1 - axo ( - X44*A) ) 
***GOO « 20*3**3*5**3* (X~®3Sp{~2X6*A) ) 
3 S 280 - 15*3**4*5**2* (X-ex»(-324*A) > 
♦QOOSO « € "* 3 * *S *S * ( X - ex» ( * 485 *A J ) 
080080 « 3**€* (I-S.xpi--729*A) J 
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Table 13 X 



10 



X can be F^^aX^H^I.T^V^DjG 
r can be L* , ,S,W,$ 

Library comprises 8 , SS -10* ffliaa-acM s«queac«s ? 1. 47**0* I 
i sequences* 8,555, $25 




rha first, second, fifth, and s * s s can hoi J 
8? the third and fourth posits >ld 8 or 0. I 

toy the mwfcer of xs t Ss, 0s, art Qfl. 




The following 
particular DN& seqts 



table shows the likelihood that a • 



will fall into one of the 



30 



1.0 



library sisse - 

total........ 1.0000B+00 

.... 3.1S24E-QX 

.... 4.I684K-02 

... ... 1.310XB-OX 

, ... 3, 8600 si- 02 

.... 5.1042E-03 

.... 2.6.73SS-03 

, ... 1.3129E-04 

.... X.73SXE-05 



« .00001% 

1.16S8E-07 
2.2926E-01 
1.80.13S-01 
2.3819E-G2 
X3t68SS ....... 2, 8073 E - 0.3 

3.6?«2E«03 
4.MXXB-04 
S.S485K-05 
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...Jl: Saaplirwr of a Library 
Encoded by jt-?nt}* iNNG) * 
{coxitis 



following 
sxpaeted 



for libraries of 



sasv sequences o 
different sizes. 



Library eisse 




30 



3S 





9 . 9X378*04 



.7) 
2.7} 
2.7} 



Library size 



3X4X6.9 ( 
4112.4 { 
12924.6 C 
3808, 1£ 2.75 
483»7 ( 10.3} 
253.4 ( 10.3) 
12.4 { 10,3} 
1.4( 35.2) 

!»000pB*O6 



fraction sancpled ~ 2.1587E-G2 



No; •: % 


22771,4 { 




17891,8 ( 




2318.5 { 




2732.5 { 




357. e( 




43. 7 { 


::. 3*5} 


8.6{ 


19,5} 



Library aize 



fraction s&srpl&d - i.07SiE~oi 

304783,9 C 8.6) XJCSQacx. . , . . 214394. G{ 12.7 

36508*6 C 23.8) X5e883?S 168452.51 12.7 

114741 , 4. £ 23 » 8) aocOfixS . . . > . 16383 . 8 { 41. 9 

33807. 7C23. B) MC808S. .... 21666, 6$ 4.1,9 

3114. 6 { 86.2} 3C388SS. . . 2837.3 ( 41.9 

1631. SC 66.2) XSOOSS,.,. . 198.4 ( 88.6 

80.1 { 66.2) SS8QSS. 38.0 £ 88,6 

3.8£ £8.7) 

3.O0P0E+O6 

2.3880B+06 fraction sampled »• 2. 79121-01 



5 ' D3.S( 18.4) 
85564.7* 55.7} 
2689X7. 8< 55.7) 
79234. 7 ( 55,7} 
4522.61 96.1} 
2369.0 C 96.1} 
118. 3 ( 96.1} 
4.0 C 1O0.O) 



565851.6 { 33.4 
443968.1 { 33.4 
39281,31 80.4 
41581.5 C 80,4 
5445.2 C 80,4 
223.7 ( 99,9 
43.9 ( 99.9 
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Ta&l® 131: Sampling of 
Encoded by {fcWtt 
(continued) 



I*llbr&rY 



Libra it ssise - 8.S556B*06 

total ...... 93Q3E+0S fraction 

xxOSsac. .... 2046301 . 0 < 44 . 0 ) 

k» . 1385 7 5 . 9 ( .9 0 . 2 } xx9 9xS 

scxOQxS.. . - . 43SS24.3 ( 30,2} 

DCS SS. 128324.1 ( 90.2} 

> i < QSS. 4703. € {100.0} 

xS 2463,8(100.0} 

S3..... 121.0(100.0} 

ssoass.,*.. 4.0 {100.0} 

Library sisse - x .00001+07 

total ...... 5. 3687B+06 fraction 

2289093. 0{ 49.2} xxOOxx 

. . 143467.0 f 93.4} xxOSxS 

. . 450896.3 < 83 ,4} xxBOxS 

132SS3.4C 83.4} xxSQSS 

4703,9(100.0) XS89SS 
2464,0(100.0) 
121.0(100.0} 
4.0(100.0) 

Library sissa * 3.0000S+07 

total ...... 7. 896XS+06 

.. 4040589. 0( 85.9} 
.» 2.53819.1(100.0) 
. . 482802.9(100.0) 
142254,4(100.0 
4704.8(100.0) 
2464.0(100.0) 
121.0(100.0} 
4.0(100.0) 




5.7626 01 
11S064S.0C 68.7} 
911935.6 ( 68.7} 
434B0.7C 99.0) 
51245.1 ( 93.0) 
6710. 7 ( 99.0) 
224.0(100.0) 
44.0(100.0) 



S.2727E-01 
1254877.0 ( 74.2} 
985974.9 ( 74.2} 
43710.7 ( 99.6} 
51516.1 C 99.6} 
6746.2 { 99.6} 
224.0 (100.0} 
44.0(100.0} 



9. 229 IE -01 
1661403 . 0 ( 98.3} 
1305393. 0( 98.3} 
43904.0(100.0} 
'51744. 0(100.0) 
6776.0(100.0} 
324.0(100.0) 
44.0(100.0) 



wo nnsm rcrmsnm $m 
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Tabla 231; Sm&lim Of a Library 
Encoded by ($FT)* ?M?G) 3 

S Library sis® - 5*60001+07 

total., 8.3S56B+06 fraction sailed - S.8130B-01 

xxfiSxx. .... 4491779.0 ( 96, 6) xxSQsoe, . . . . 1688387. 0{ 99, 9} 

xxQSsoc, < , < . 1 53663 . S (100 , 0 ) xkOOkS ..... 1326590 , 0 { 99.9} 

10 xxOQxS 482043.4(100.0} xxQQxB . , . , . 43004.0(100.0} 

3OCS0SS ..... 142285 . 8 ( 100 . 0} XX&8SS ..... 0X744 . 0 ( 1 00.0 } 

xxQQSS. .... 4704,0 (100.01 xSSBSS. .... 6776 .. 0 (100 . 9} 

XS8Q8S..... 2464.0(100.0) xSDQSS . 224.0(100.0} 

8S88SS . 12X.OC100.0) SS0QSS . . . . . 44.0(100.0} 

15 SSQ08S . . . . . 4.0 C 100 , 0 ) 

Library size » 1.0000E+08 

total. ...... 8.55Q3B+06 fraction sampled * 9.S938B-01 

20 rxb-lo: 4.643063.0 C 99.9} xxSOxx. . , . . 1690302,0(100.0! 

XXOQKX. . . . . 153604,0(100.0} XXBBXS . . , , , 1328094,0(100.0} 

xxOQxS 482944.0(100.0) xxDQxS..... 43904,0(100.0} 

xx89SS 142296.0(100.0) aacSOSS. , . . . 51744,0(100.0} 

aacCaSS 4704.0(100.0) xSSeSS..... 6776,0(100.0) 

25 XS0Q5S 2464,0 (100.0) xSOQSS 224,0(100,0) 

SS60SS 121.0(100.0) SS0OS8...*, 44,0(100.0) 

6S0QSS...... 4,0(100.0) 
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Table 132 s Relative 



:ive efficiencies of 
variegation eodona 



s 






afeer of coc 


Ions 






.,.,„ £ ■ 


„„JL 


% 


























ISM] 






10 






13.96 
















stops van 


ish (3, a- 10 s ) 


<8\4*1Q 7 ) 








1.38 


1,47 


1.57 


IS 




E1.05-1OS3 


£1.68 '10*] 


[2.88*10*] 






(7. 59* 10 s ) 


£lvl4-10 T ) 


{1,71*10*) 



2,04 2.36 2,72 

[7.S9«l.iVl |1.14* 10 s ! 11.71*10*3 
(4.83 -10*) C6.27-X0?) 



wo vmsm 
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Tab!® 148, Effect Of anti BPTI XgQ en phage 



10 



IS 



Input *.&nU.-BPTI h& f K3.ur.ed 
SXX&iXL , t£rPt^in,A{ g Q 




Table 141. S££«ot of anti-SOT ©r protein A on pfct&g® titer. 

Ho +Ant i ~ +&&t i - 

Strain Input Addition BPTX Protein A Br?r 



100(b) 107 105 72 65 

mim £2 uur! m 



Protein &~&gaxo«e beads 

Percentage of input phage measured as plaque 
forming mxits 
Batch number S 



WO 92/15679 



s 


Strain 


input 


BPTI 


+KRS 
(a) 


a>) 


+m8 




MX3MPX8 


100 {cj 








8S 




M13M848 m 


100 


30 


125 


13 


121 


10 




AQSL 


3L 


193. , 


„ Qt? 


...ii a.... 




as plague 



Table 143 * 





AOhydirofcrypsin 


Streptavidln 










Post 




ai^fc - 




££&X£ ™...iss 


i&afcisa 


M13MP18 


100 (a) 


121 




Hp 


M13MB48 


100 


58 


10:0 


98 


5M Pool 


100 


44 


100 


93 



30 



wo n/tsm 
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Table 144, Blading of I&soXay 3?&aga to Jmhy&ro trypsin, 
Experiment 1, 

lage (:%) Relative to 

8H.3MP18 

0.2 (a) 1*0 
7.9 39, S 

±1,2 5S<Q 



10 

Strain Eluted Phage fa} Relative to 



0.3 2U0 
12.0 40.0 
17.0 56,7 



(a) Plaque forming units acid elated from beads 
as a percentage of the input. 

T&bl® 145, Blading of Bijinlay P&&f« to *atoyarotryp*i» &r 



Strain.... 



mis™ 



Pfaacre Relative 



| 0,1 X j 2,3x10-* 1.0 

BPTI-IXIMK i 0,1 $1 j 1-17 5X103 

MX3.33E7 | 25. 0 250 j 1.4 6x.X0 s 

30 M13.3X11 | 3,2 32 \ 0.27 l,2x!0 3 

Ca) Plaque forming units sluted from beads, esspressed as a 

percentage of the input , 

(to) Relative to the non- display phage, M13MPXS, 

3S 



wo mmm 
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......... MLMl 


s 




JftL- 


mge Relative 
....I Bindxn^ifeL 


Elufcsd Relative 




K13M?18 




1 


3x10'* 








Xj 1.0 


2000 


SxlO-' 3 


U.7 


10 




0.13 


260 


9x3.0** 


30.0 


: 


&S13.33C7 


1.15 


23O0 


^ 


3.3 




H13 , 3X11 


0,8 


1600 




6V7 




SPTX3 . CI; 


IxlO' 3 


2 I 


4.1 


1.45SX 0* 



ic} 



(a) Plaque foraiag units acid eluted from thB beads, 

©stressed as a percentage of input. 

{h} Relative to the non-display -phage, pft3$P3&* 

(C) BPTl-IIXMK {KISh 



WO 92/15679 



PCT/US92/0I539 



147 

t«*»i* 8*0*: smmtm^iMM^mumm 

Putativfe StrepfcavMiaa 

JDEV{P) A B - F C H P Q Y E L C 0 X f H Q J P P P P P S S.», 
PevCI) A 8 - h C B ? 0 F ? E C M L F R K V P P P P P P A B. » . 

HPQS A E G P &.,&,,,£...£...£...£...£■■■■& Y I E G R X V E-«-.-. 

llllUXl 1 1 2 2 2 2 2 2 2 
1 2 3 1 S U S H 1 2 3 4 S 6 7 8 S 0 1 2 3 4 S 6 
~ ~ - - C — - - - , C « B 



WO 92/15679 



PCTAJS92/01539 




dtp faflg) R&la^^slt^d^. 

0 1.00 

2 0.S5 

10 i-oo 

20 



92/15679 



FCT/i;S9.2/()lS39 



-M UL 



1abl» Effect of tlm pr«wee of 

Bit tfa« binding of display phag* to 



Inputs: MOT 4.2 X iQ 3i , HPQS 3.3 X 10». 



— m Mnatog 

nra o 4.8 x io-* i. go 

2, 000 5.4 x W* 1.10 

10. < 000 5.2 x 10"* 1.10 

HPQS 0 l.S x 1.00 
1 l.S X 10* 1.00 
10 l.S x W 4 0,90 
100 9.7 x 10 vS 0,60 
1000 1„$ X W 4 0,10 
2,000 1.0 X W s 0,06 
— <M X *%!!_■ OtP$ 



mm/mm 



ISO 



Titer after 



Relative 



Relative 



3.3 x lQ li 



1.4 x 10** 



833 s F3£* 
jfollowiog- 



of display ; 

to st^&pt&vidisu 



Fraction 
Elufced 



7*6 x iff* 



l.« x 10' 3 
,JL£..K.. *0lL 



by 



14 
-JUCL 



of IP06 display ; 



Added 


Time 

.......mm) 




0 




l? 


2.5 




21 


6.3 




22 


12 rS 




35 


0 




3LS 


2.5 




S3 






54 






52 ....„„ 



WO 92/15679 



*»CT/US9 J/01539 



CK0U74 ; 
C0LM87 ! 



0; 

BEL&88; 
BSVL90; 
DICKS 3 ; 



mtth®m, ana {1985} Protein 

Sng inhering s 387-91, 

Baxmer, TM f C gfave, m Marvin, Mature U98i) , 
Mi;8X4~8IS\ 

Becker, S, I Afchfcrten,. and RD Gordon, Bur J Biochem, 
(Oct 20 1989}, Ifi&aj 79-84. 

Berg, 0M, Proc Natl Acad Sci tESA (198$) , IS 5 99-102. 



, J' Mol Biol (1980) , 



(X988) , aMilS4i-1043 . 
JD, M RusseX, and P 3 
2M;103~XX8\ 

Chang, C&, P Model, end 6 BXobel , Proe Natl tod Sci 
VfBA i 19 79 } , 2&i 12S1 - 1255. 

Chou, PY, and GD M«a, Biochemistry (1974) , 
(2) 232*45. 

MmQStm&M...M&.3^mlSmi& t Second Edition, Baiters 
Caiman, Hlreh, Marder, end Sfcltma* Published by 
Pippiacott, PhiXadelphia, PA, 1987, ISBN 0-397- 
50679-1. 

Creightoa, m f mtmmi^Mmsmsm aa&..^lgffiU&g 

Principles,, W E Freeman & Co, New York, 1984. 
CwirXa, SB, EA Peter®, m Barrett, and war Dower, 
Proc Natl tod Sci USA, {&»gust 18905 , 52s 6 3 78 -63 82. 
da la Cms, ¥F, a& Lai and TF MeCutdfcstn, 
J Biol Cham, (1988), 2HC9) 43X8-22. 
Devlin, JJ, LC Panganiban, and PB Devlin, Science, 
(27 July 1990), 249 i 404-408. 

Bic&erson, RE, and X Geia, mmsl^hm^J^xmSMm^ 
B&B£&i£^^ The 
Bejamin/Cu^ihg® Psa&lisbing Co, MenXo Park, ca, 
1983. 

Dttlb«CCO, H, X7S Patent 4,5.93,002, June 3, 1986. 



vmmtsm 



152 



VSQK19 t 
J&N&89 t 

jausxsss 

K&PL78 t 



Oauss, F, KB Krassa, J3S McPneetsrs, m. NeXson, and L 
Gold, Proc mtX tod Sci USA (198?) , M*8SX5~19- 
Gibson, TJ, JPM Fostsaa, SS Brown, and P Argos, 
Protein Engineering {1988} f 2(3)209-218. 
Ousn, XL and M:xon 5B , Ite&X. Blocheaa, (1991} .. 212" 

Hard, T s B KeXlesfoach, » Boelan*, BA Maler, K 
Danisms., LP Freedman, J Carlstedt - Duke , XR Yamaaoto, 
j>A Gnetarsson, and It K&ptein, Science (13 July 
1990} , 343:3.57-60. 

Korvst, 9, B Grgaa, M Raos, and VI Simeon, Xnt J 
Peptide Protein Res (1989) , M^46-5X. 
^ ( H, W Barnes, and J Beclcwith, J Baeterioi 
(1982) , lil. (2) 434-439. 

Xto, X, G M&ndftl, end W wickner, Proc Hatl Acad Sol 
TXS* (X97S), 2&;XX99~XR03. 

Janatova, J, W Eeid, and AC Willis, Biocnem 
U933), 3ft*47S4~fiJU 

jaain, or, and C Chothia, Methods in Btosyxaology 
(1985) , 115 (23) 420- 430. 

Kaplan, BA, Srsenfield, and G Wilcox, in Silt-. 
SMsMMmtoSI. Denhardt, m, D 
Dressier, and BS Hay editors, Cold Spring Harbor 
Laboratory, X978., »46X~4S7. 

Eatz, B, and AA Koesiakoff , Proteins, Struct, Fuact, 
and Ssset (1990) , 2*3*3 -57. 

Kim, et al. ; Protein Engineering (1989), %{!}>. 379- 



R, and P Baiaram, 
24^041.-43. 



WO 92/15679 



PCT/US92/61539 



Kuha, A, and W Metaer, J Biol Cham (1985) , 
M£s 13914-15918, 

Zxihn, &, and W wickner, J Biol Cham (1985) , 

1S9 07. 159.13.. 
Ruto, A, Science {1987} , 21&J 1413 -1415. 
Lias, LE, BL Johnson* and DB Oliver, J Bacterial 
{198$} , l£i (2)925-8. 

Lopes, j, aad RE Webster, J Bacteriol (198S) , 
jlfia {3} 1270-4. 



Bay, J Virology , { 1985 > , M U ) 2 68 - 



m f mis 

nm mm, gmtm^i 

lit BLB Caspar, and PA 
{ IS 80 } < 1M? 149 - 181 . 
Maniafcis, T, EW Fritseh, and J 

Cold Spring Harbor Laboratory* 1982. 



2 76. 



Marks,. CB, M Vasi«r, P Kg, W Hansel,- and & toderson, 
J Biol Chem (1986)., 2£X; 7113-7118. 
Marvin, BA„ in T&S §tel^-^TOd^Sm..Bi^E, 
Bahhardt, DT, B Brassier, and BS Ray editors,. Cold 
Spring Harbor Laboratory, 1978.,, pS83-603. 

M, m Becktel, K Levitt, and BW Hat thews,. 
Proa Natl Acad Soi USA. {1989},. M!$562~S, 
McCaf forty, J f AD Griffiths, G Winter,, and m 
Chiswell, Nature, CS Dec 1990} , Mfi:SS2-4. 
Massing, jr, 8 Gronenborn, S Muller-Hill, and 
PH Kof schneidar, Proc Natl Acad Sci BSA {1977} f 

g, J, and S Gronenborn, in Si^ailHlt^txaMM 



_ jBssibsrat, &£, -P Presslsr, and PS Kay 
editors, Cold Spring HarBor laboratory, 197S* f p449» 
453. 

H2CS8 s s RicholsKS*, H, W fecktel, and SW Mat thews, Nature 

(1988) , 33St 651-56. 
j?XSHB2 ; Kislaluchi, Y, and S Sakakifoarsa, FBBS Lett (1982} f 

KISH86i Mshiuchi, X, K temagaya, Y »oda, TX Watanabe, and S 

Sakaklbara, Blopolymers, 11986} , 2£jS6X-8» 
mMBXt Ghkam, I, and SB Webstar, J Biol Chess (1981) , 

OMWOa: OXivara, SM, J Rivier, C Clark, Ca Kamilo, GP 
Corpus, m Abogadie, SB Mena, B& Woodward, m 
Hillyard, LJ Cruz, Science, (20 July 19$ 0) , M&a57- 
263. 

p&BQ?9s Pabo, CO, ZT Sauer> m Stnrtewnt* and M Pta&bne, 

Proc Hat! Acad Sci USA (1979) , 2fis 1608- 1612, 
FBRMSSj Parsley, SP, arid OP Smith, Gene £1988), 23^ 3 05 -3 IS, 
PEEKS 4 ; Ferry, L-J, and R Wetzel, Science {1984}, 2at£*S55-7. 
PERES 6 ; Pew, LOT, and R Wetzel, Bioeaem (1986), 21:733-39. 
p 0 TSS3t Pocaete, AR f J Sfol Biol (19833, 121*401-4X8, 
RmCBGi Rascbed, X, and E Oberer, Microbiol Rev {1986} 
£&t 4 01-427, 

{13841, a2.;S518. 
in SSnssymoXolgy (1985), X1&C29) 430- 

440, 

Bushel, U t and F Model, Proc Natl Acad Sci USA 
{1981) , 7.8:1.717-1721. 

Rassel and Model, Cell (1982}, 21U3 ' 177-84. 
Sali, D, M Bycroft, and M. Far slit, Nature C198S) , 

aa&t 740*3. 

Saafcrook, J, m Fritseh, and T Mania tis, miXasssXSX. 
iMaOiMX**. second Edition, Cold 
Spring Harbor Laboratory, 1989. 



WO 92/15679 



PCI7US92/01559 



155 



SCOTS 7a; Scott 



WELLS S i 



Sauer, RT, I mr.au, MR Weiss, 

A Jaitier-wilsson, BCS Sncbanak, and CO Fafoo r Siaoheis 
tX986) , a& J 3992 -98. 

Scfcaller, K f 1 Beck, and M Takanami, in Tn&JSlftff.l&~ 
Stranded PNA Phages. Benhardt, D.T, , D, Dressier, 
and &.S.. &ay editors, Cold Spring Harbor laboratory, 
1978*, £3,39-163, 

Schaabel, B, w Schroedar, and 0- Reinhardt, Biol Chess 
Hoppe-Seyler mm) > 1SI:XX€?-7S , 

CS Huckaby, 1 Kato, WJ Kohr, M Las&©*ski 
Jr., M»J Tsai and BW O'Malley, J Biol Cheat (if 87} , 
Ml (12) 5 $m -5907. 

Scott, JK, and GP Smith, Science, (2? July X990) , 
2MJ386-390. 

Sstith £?B, Science (1885), 22Sj 1315-1317. 
Suaaaers, J- Cell Biochers, {1891} , i 41-8, 

Vita, C, D D&Xsopps, and A Fontana, Biochemistry 
(1984), 22sS512«SSX8. 

Webster, EE, and CTS Cashman, in $te..&imlteMX&1&!&. 
pm. gh.a?es . Dehhardt, DT, D Dressier, and DS Ray 
editors, Cold Spring Harbor Laboratory, 1978., p557- 

wells, ja, and DB Powers, J Biol Cham C198S) , 

Yaniseh-Parron, C, <J Yiaira, and J Messing, Cane, 
C198S) , M; 103-119. 



Z&faraila, GC, C Ramilo, WE Gray, R Karlstrom, 
Olivera, and LJ Cms, Biochemistry, {1988) , 
M USj 7103-5. 

Zisroenrann, &, C Watts, end w Wlekner, J Biol C 



PCT/US92/01539 



1. In a process for developing .novel epitopes or 
Binding proteins with a desired binding activity against a / 
particular target material -«ldch cos^rises providing a 

5 library of phage wMeh each displays on its surface, as a * 
result of expression of a first phage gene, one or mora 
copies of a particular chimeric coat protein,- each chimeric 
coat protein comprising a potential epitope* or a potential 
binding domain •which is a mutant of a known protein domain 

10 foreign to said phage, said library collectively displaying 
a plurality of potential epitopes or binding domains,, 
contacting said library of phage with the target .material, 
and separating the phage on the basis of their affinity for 
the target material, the improvement wherein eaid chimeric 

15 coat protein further comprises a linker peptide which is 
specifically cleavahle by eaid site-specific protease* 

2. The method of claim l wherein the site- 
specific protease is Factor Xa, Factor Kia, kallikrein, 
thrombin. Factor rila, eollagenase or entarokinase. 

2Q 3 - The method of claim l wherein, after said 

literary of phage is contacted with said target material, 
(13 low affinity phage are removed, (2) high aff inity phage 
still bound to said target material are released by 
cleavage of said chimeric coat protein at said linker by 

25 means of a site- specific protease, and the released high 
affinity phage are recovered. 

4, la a process for developing novel binding 
peptides or proteins with a desired binding activity 
against a particular target material which comprises 

30 providing a library of phage which each displays on its 
surface, as a result of expression of a first phage gene, 
one or store copies of a particular chimeric coat protein, # 
each chimeric coat protein comprising a mutant of a known 
protein domain foreign to said phage, said library 
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collectively displaying & plurality of potential blading 
domains .. contacting said library of phage with the target, 
saaterial, and separating the phage on the basis of their 
affinity for the target material, the improvement wherein 
5 said said potential binding domain has at least one 

intraehain covalant crosslink, between a first amino acid 
position and a second amino acid position thereof, the 
amino acids at said first and second positions being 
invariant in ail of the chimeric proteins displayed by said 

10 library, and where low affinity phage are removed from said 
target material first, and then high affinity phage are 
released or rendered more readily eluted from the target 
material by treating the phage with a reagent which cleaves 
the crosslink, preferably a reagent which does not Mil the 

IS phage . 

5, The method of any of claims 1-4 wherein the 
domain is a mini-protein of less; than sixty amino acids, 
more preferably a micro-protein of lees than forty amid© 
acids. 

20 6. The method of claim 4 wherein the crosslink 

is a disulfide bond and the amino acids at said first and 
second positions ere cysteines* 

7. The method of claim S wherein the reagent is 
dithiothreitoi , 

25 8. The method of any of claims 1-7 wherein the 

phage further comprises a second phage gene encoding the 
cognate wild- type coat protein of the phage, 

9, In a process for developing novel epitopes 
with a desired binding activity against a particular target 

30 material which comprises providing a library of phage which 
each displays on its surface, as a result of expression of 
a first phage gene, one or more copies of a particular 
chimeric coat protein, each chimeric coat protein 
comprising a potential epitope, said library collectively 
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displaying & plurality of potential epitopes, contacting 
said library of phage with the target material,- and 
separating the phage on the has is of their affinity for the 
target material, the improvement wherein the phage further 
S comprises a second phage gene encoding the cognate wild- 
type coat protein of the phage. 

10. is a process for developing novel epitopes 
or binding proteins with a desired binding activity against, 
a particular target material which comprises providing a 

10 library of phage which each displays on its surface, as a 
result of expression of a first phage gens, one or more 
copies of a particular chimeric coat protein, each chimeric 
coat protein comprising a potential epitope, or a potential 
binding domain which is a mutant of a known protein domain 

15 foreign to eaid phage, said library collectively displaying 
a plurality of potential epitopes or binding domains, 
contacting said library of phage with the target material, 
and separating the phage on the basis of their affinity for 
the target material , the improvement wherein the chimeric 

20 coat protein includes only an aasemfelable fragment of a 
coat protein of said phage, ana not that portion of the 
coat protein which is responsible for pilos binding,, and 
the phage also comprises a second phage gene encoding the 
cognate native coat protein of the phage, 

25 xi- In a process for developing novel epitopes 

with a desired binding activity against a particular target 
material which comprises providing a library.. of phage which 
each displays on its surface, as a result of expression of 
a first phage gene, one or more copies of a particular 

30 chimeric coat protein, each chimeric coat protein 

comprising a potential epitope, said library collectively 
displaying a plurality of potential epitopes, contacting 
said library of phage with the target material, and 
separating the phage on the basis of their affinity for the 
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j st material, the improvement wherein the cognate wild- 
type coat protein of the phage is the major coat protein of 
the phage 

12, The method of clalMs s-xa. whereis the 

5 initiation codou of the second phage gene is a leucine, 

13 , The method of any of claims i ~ 12 wherein the 
first phage gene further comprises a cytoplasmic secretion 
signal sequence which codes for a signal peptide which 
directs the immediate expression product to the inner 

10 membrane of the bacterial host ceil infected by said phage,, 
where it is processed to remove said signal peptide, 
yielding a mature chimeric coat protein comprising the 
potential binding domain and at least a portion of a 
geneVXXI-iike protein of the phage , said chimeric protein 

15 being assembled with wild-type coat protein into the phage 
coat, wherein the secretion signal is encoded by a signal 
sequence selected from the group consisting of the signal 
sequences of the j&La. and ffenelXX genes, 

14 , In a process for developing novel epitopes 
20 or binding proteins with a desired binding activity against 

a particular target material which comprises providing a 
library of phage which each displays on its surface, as a 
result of expression of a first phage gene, one or more 
copies of a particular chimeric coat protein, each chimeric 

23 coat protein comprising a potential epitope, or a potential 
binding domain which is a mutant of a known protein domain 
foreign to said phage, said library collectively displaying 
a plurality of potential epitopes or binding domains, 
contacting said library of phage with the target material, 

30 and separating the phage on the basis of their affinity fox* 
the target material f the imj s > v ge also 

i phage gens e- 1 -- the cognate native 

coat protein of the phage, and the initiation eodon of the 
second phage gene is a Leucine eodon. 
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15. method of any of claims 1-14 herein, the 
differentiation among said plurality of different potential 
binding domains occurs through the -«t least partially 
random variation of one or war& predetermined amino acid 
5 positions of said known domain to randomly obtain at each 
said position an amino acid belonging to a predetermined 
set of two or more amino acids, the amino acids of said set 
occurring at said position in predetermined expected 
proportions . 

10 16, The method of claim 15 wherein the differentiation 

among said potential binding domains of said library is limited to 
no more than about 20 predetermined amino acid residues of said 
sequence* 

171 The method of claim 15 wherein, for each set, the 
IS ratio of the probability of occurrence of the most favored amino 
acid to that for the least favored amino acid is less than aboout 
2,6. 



18, The method of any of claims 1-17 wherein, for any 
20 potentially encoded potential binding domain, the probability that 

it will be displayed by at least one package in said population is 
at least 50%, more preferably at least 80%, 

19, The method of any of claims 1-18 wherein said 
population is characterised by the display of at least IS* 

2S di s ^ ~ potential binding domains. 

20, The method of any of claims 1-19 -wherein the 
initially chosen parental potential binding domain is selected from 
the group consisting of (a) binding domains of bovine pancreatic 
trypsin inhibitor, cramhin, cacurbita Lm&iiM. trypsin inhibitor III, 

30 a heat-stable enterotoxin of l&chsrickja fifili, an alpha-, mx~ or 
omega- conotoxin, apamin, charybdotomn, secretory leukocyte 
protease inhibitor, cystatta, eglin, barley protease inhibitor, 
ovomucoid, T4 iysotyme, hen egg white lysozyxae, ribonuclease , 
asurin, tumor necrosis factor, and CD4, and (h> domains at least 
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substantially homologous with any of w.x^, tax w 3 

have a melting point of at least; 5"0*Cv 

21. In a process for developing novel epitopes with a 
desired affinity for a particular binding protein target material 

S which comprises providing a library of phage which each displays on 
its surface, as a result of expression of a first phage gene, one 
or more copies of a particular chimeric coat protein, each chimeric 
coat protein comprising a potential epitope, or a potential binding 
.domain which is a mutant of a known protein domain foreign to said 

10 phage, said library collectively displaying a plurality of 

potential epitopes or binding domains, contacting said library of 
phage with the target material, and separating the phage on the 
basis of their affinity for the binding protein target material, 
the improvement wherein the differentiation among said plurality 

15 of different potential binding domains occurs through the at least 
partially random variation of one or more predetermined amino acid 
positions '.at said known domain to randomly obtain at each said 
position an amino acid belonging to a predetermined set of two or 
more amino acids, the amino acids of said set occurring at said 

.20 position in predetermined expected proportions, and in 

substantially all sets the ratio of the frequency of occurrence of 
the most favored amino acid to that for the least favored amino 
acid is less than 2.6. 

22. The method of claim 21 in which at least one 

25 variable amino acid position is encoded by a simply variegated 
codon selected from the group consisting of mr f wm, HMG.< 
YM\. m$, and SK'Xt 

23. The method of claim 21 wherein none of the variable 
amino acid positions is encoded by a simply variegated codon 

30 selected from the group consisting of knn, mm and W®S. 

24. The method of claim 21 in which at least one 
variable amino acid position is encoded by a complexly variegated 
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25. A library of display phage wMch each displays on 
its surface, as a result of agression, of a first phage gene, one 
or more copies of a particular chimeric coat protein, each chimeric 
coat protein comprising a potential epitope, or a potential binding 
S domain which is a mutant of a known protein domain foreign to said 
phage, said library collectively displaying a plurality of 
potential epitopes or .binding domains, contacting said library of 
phage with the target issaterial* &nd separating the phage on the 
basis of their affinity for the blading protein target mtarial, 

10 wherein the differentiation among said plurality of different 
potential binding domains occurs through the at least partially 
random variation of one or more predetermined amino acid positions 
of said known domain to randomly c&fcaln at each sain position an 
amino acid belonging to a predetermined set of two or more amino 

IS acids, the amino acids of said set occurring at said position in 
predetermined expected proportions, and in substantially all sets 
the ratio of the f raquency of occurrence of the most favored amino 
acid to that for the least favored amino acid is less than 2.&. 

,26. A library of display phage which each displays on 

20 its surface, as a result of expression of a first phage gene, one 
or more copies of a particular chimeric coat protein, each chimeric 
coat protein comprising a potential epitope, or a potential binding 
domain which is a mutant of a known protein domain foreign to said 
phage, said library collectively displaying a plurality of 

25 potential epitopes or bindiag domains, contacting said library of 
phage with the target material, and separating the phage on the 
basis of their affinity for the target material, wherein said 
chimeric coat protein further comprises a linker peptide which is 
specifically eleavabX® by said site-specific protease. 
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